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(B) LOCATION: 1..4374 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATG GAA ATA GAG CTC TCC ACC TGC TTC TTT CTG TGC CTT TTG CGA TTC 4 8 

Met Glu lie Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe 
15 10 15 

TGC TTT AGT GCC ACC AGA AGA TAC TAC CTG GGT GCA GTG GAA CTG TCA 96 
Cys Phe Ser Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser 
20 25 30 

TGG GAC TAT ATG CAA AGT GAT CTC GGA GAG CTG CCT GTG GAC GCA AGA 144 
Trp Asp Tyr Met Gin Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg 
35 40 45 

TTT CCT CCT CGC GTG CCA AAA TCT TTT CCA TTC AAC ACC TCA GTC GTG 192 
Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val 
50 55 60 

TAC AAA AAG ACT CTG TTT GTA GAA TTC ACG GTT CAC CTT TTC AAC ATC 24 0 

Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Val His Leu Phe Asn lie 
65 70 75 80 

GCT AAG CCA AGG CCA CCC TGG ATG GGT CTG CTA GGT CCT ACC ATC CAA 2 88 

Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr lie Gin 
85 90 95 

GCT GAG GTT TAT GAT ACA GTG GTC ATT ACA CTT AAG AAC ATG GCT TCC 336 
Ala Glu Val Tyr Asp Thr Val Val lie Thr Leu Lys Asn Met Ala Ser 
100 105 110 

CAT CCT GTC TCC CTT CAT GCT GTT GGT GTA TCC TAC TGG AAA GCT TCT 3 84 

His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser 
115 120 125 

GAG GGA GCT GAA TAT GAT GAT CAG ACC AGT CAA AGG GAG AAA GAA GAT 432 
Glu Gly Ala Glu Tyr Asp Asp Gin Thr Ser Gin Arg Glu Lys Glu Asp 
130 135 140 

GAT AAA GTC TTC CCT GGT GGA AGC CAT ACA TAT GTC TGG CAA GTC CTG 48 0 

Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gin Val Leu 
145 150 155 160 

AAA GAG AAT GGT CCA ATG GCC TCC GAC CCA CTG TGC CTT ACC TAC TCA 52 8 

Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser 
165 170 175 

TAT CTT TCT CAT GTG GAC CTG GTT AAA GAC TTG AAT TCA GGC CTC ATT 5 76 

Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu lie 
180 185 190 

GGA GCC CTA CTA GTA TGT AGA GAA GGG AGT CTG GCC AAG GAA AAG ACA 624 
Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr 
195 200 205 

CAG ACC TTG CAC AAA TTT ATA CTA CTT TTT GCT GTA TTT GAT GAA GGG 6 72 
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Gin Thr Leu His Lys Phe lie Leu Leu Phe Ala Val Phe Asp Glu Gly 
210 215 220 

AAA AGT TGG CAC TCA GAA ACA AAG AAC TCC CTC ATG CAA GAT AGG GAT 72 0 

Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gin Asp Arg Asp 
225 230 235 240 

GCT GCA TCT GCT CGG GCC TGG CCT AAA ATG CAC ACA GTC AAT GGT TAT 768 
Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr 
245 250 255 

GTA AAC AGG AGC CTG CCA GGA CTG ATT GGA TGC CAC AGG AAA TCA GTC 816 
Val Asn Arg Ser Leu Pro Gly Leu lie Gly Cys His Arg Lys Ser Val 
260 265 270 

TAT TGG CAT GTT ATA GGA ATG GGC ACC ACT CCT GAA GTG CAC TCA ATA 864 
Tyr Trp His Val lie Gly Met Gly Thr Thr Pro Glu Val His Ser He 
275 280 285 

TTC CTC GAA GGA CAC ACA TTT CTT GTT AGA AAC CAT CGC CAG GCG TCC 912 
Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gin Ala Ser 
290 295 300 

TTG GAA ATC TCG CCA ATA ACT TTC CTT ACT GCT CAA ACA CTC CTC ATG 96 0 

Leu Glu He Ser Pro He Thr Phe Leu Thr Ala Gin Thr Leu Leu Met 
305 310 315 320 

GAC CTT GGA CAG TTT CTA CTG TTT TGT CAT ATC TCT TCC CAC CAA CAT 1008 
Asp Leu Gly Gin Phe Leu Leu Phe Cys His He Ser Ser His Gin His 
325 330 335 

GAT GGC ATG GAA GCT TAT GTC AAA GTA GAC AGC TGT CCA GAG GAA CCC 1056 
Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro 
340 345 350 

CAA CTA CGA ATG AAA AAT AAT GAA GAA GCG GAA GAC TAT GAT GAT GAT 1104 
Gin Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp 
355 360 365 

CTT ACC GAT TCT GAA ATG GAT GTG GTC AGA TTT GAT GAT GAC AAC TCT 1152 
Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser 
370 375 380 

CCT TCC TTT ATC CAA ATT CGC TCA GTT GCC AAG AAG CAT CCT AAA ACT 12 0 0 

Pro Ser Phe He Gin He Arg Ser Val Ala Lys Lys His Pro Lys Thr 
385 390 395 400 

TGG GTA CAT TAC ATT GCT GCT GAA GAG GAG GAC TGG GAC TAT GCT CCC 124 8 

Trp Val His Tyr He Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro 
405 410 415 

TTA GTC CTC GCC CCC GAT GAC AGA AGT TAT AAA AGT CAA TAT TTG AAC 12 96 

Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gin Tyr Leu Asn 
420 425 430 

AAT GGC CCT CAG CGG ATT GGA AGG AAG TAC AAA AAA GTC CGA TTT ATG 1344 
Asn Gly Pro Gin Arg He Gly Arg Lys Tyr Lys Lys Val Arg Phe Met 
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435 440 445 

GCA TAC ACA GAT GAA ACC TTT AAG ACT CGT GAA GCT ATT CAG CAT GAA 13 92 

Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala lie Gin His Glu 
450 455 460 

TCA GGA ATC TTG GGA CCT TTA CTT TAT GGG GAA GTT GGA GAC ACA CTG 144 0 

Ser Gly lie Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu 
465 470 475 480 

CTC ATT ATA TTT AAG AAT CAA GCA AGC AGA CCA TAT AAC ATC TAC CCT 14 8 8 

Leu lie lie Phe Lys Asn Gin Ala Ser Arg Pro Tyr Asn lie Tyr Pro 
485 490 495 

CAC GGA ATC ACC GAT GTC CGT CCT TTG TAT TCA CGC AGA TTA CCA AAA 15 3 6 

His Gly lie Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys 
500 505 510 

GGA GTA AAA CAT TTG AAG GAT TTT CCA ATT CTG CCC GGA GAA ATA TTC 1584 
Gly Val Lys His Leu Lys Asp Phe Pro lie Leu Pro Gly Glu lie Phe 
515 520 525 

AAA TAT AAA TGG ACA GTG ACT GTA GAA GAT GGG CCA ACT AAA TCA GAT 163 2 

Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp 
530 ' 535 540 

CCT CGG TGC CTG ACC CGC TAT TAC TCT AGT TTC GTC AAT ATG GAG AGA 168 0 

Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg 
545 550 555 560 

GAT CTA GCT TCA GGA CTC ATT GGC CCT CTC CTC ATC TGC TAC AAA GAA 172 8 

Asp Leu Ala Ser Gly Leu lie Gly Pro Leu Leu lie Cys Tyr Lys Glu 
565 570 575 

TCT GTA GAT CAA AGA GGA AAC CAG ATA ATG TCA GAC AAG AGG AAT GTC 177 6 

Ser Val Asp Gin Arg Gly Asn Gin lie Met Ser Asp Lys Arg Asn Val 
580 585 590 

ATC CTG TTT TCT GTA TTT GAT GAG AAC CGA AGC TGG TAC CTC ACA GAG 18 24 

lie Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu 
595 600 605 

AAT ATA CAA CGC TTT CTC CCC AAT CCC GCT GGA GTG CAG CTT GAG GAT 18 72 

Asn lie Gin Arg Phe Leu Pro Asn Pro Ala Gly Val Gin Leu Glu Asp 
610 615 620 

CCA GAG TTC CAA GCC TCC AAC ATC ATG CAC AGC ATC AAT GGC TAT GTT 192 0 

Pro Glu Phe Gin Ala Ser Asn lie Met His Ser lie Asn Gly Tyr Val 
625 630 635 640 

TTC GAT AGT TTG CAG TTG TCA GTT TGT TTG CAT GAA GTA GCA TAC TGG 1968 
Phe Asp Ser Leu Gin Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp 
645 650 655 

TAC ATT CTA AGC ATT GGA GCA CAG ACT GAC TTC CTT TCT GTC TTC TTC 2 016 

Tyr lie Leu Ser lie Gly Ala Gin Thr Asp Phe Leu Ser Val Phe Phe 
660 665 670 
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TCT GGA TAT ACC TTC AAA CAC AAA ATG GTC TAT GAA GAC ACA CTC ACC 2 064 

Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr 
675 680 685 

CTA TTC CCA TTC TCC GGA GAA ACT GTC TTC ATG TCG ATG GAA AAC CCA 2112 
Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro 
690 695 700 

GGA CTA TGG ATT CTG GGG TGC CAC AAC TCA GAC TTT CGG AAC AGA GGC 216 0 

Gly Leu Trp lie Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly 
705 710 715 720 

ATG ACC GCC TTA CTG AAA GTT TCC AGT TGT GAC AAG AAC ACT GGA GAT 22 08 

Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp 
725 730 735 

TAT TAC GAG GAC AGT TAT GAA GAT ATT TCA GCA TAC TTG CTG AGT AAA 22 56 

Tyr Tyr Glu Asp Ser Tyr Glu Asp lie Ser Ala Tyr Leu Leu Ser Lys 
740 745 750 

AAC AAT GCC ATT GAA CCA AGA AGC TTC TCC CAG AAC CCA CCA GTC TTG 2 3 04 

Asn Asn Ala lie Glu Pro Arg Ser Phe Ser Gin Asn Pro Pro Val Leu 
755 760 765 

AAA CGC CAT CAA CGG GAA ATA ACT CGT ACT ACT CTT CAA TCA GAT CAA 2 3 52 

Lys Arg His Gin Arg Glu lie Thr Arg Thr Thr Leu Gin Ser Asp Gin 
770 775 780 

GAG GAA ATT GAC TAT GAT GAT ACC ATA TCA GTT GAA ATG AAG AAG GAA 24 00 

Glu Glu lie Asp Tyr Asp Asp Thr lie Ser Val Glu Met Lys Lys Glu 
785 790 795 800 

GAT TTC GAC ATT TAT GAT GAG GAT GAA AAT CAG AGC CCC CGC AGC TTT 2 44 8 

Asp Phe Asp lie Tyr Asp Glu Asp Glu Asn Gin Ser Pro Arg Ser Phe 
805 810 815 

CAA AAG AAA ACA CGA CAC TAT TTT ATT GCT GCA GTG GAG AGG CTC TGG 2 4 96 

Gin Lys Lys Thr Arg His Tyr Phe lie Ala Ala Val Glu Arg Leu Trp 
820 825 830 

GAT TAT GGG ATG AGT AGC TCC CCA CAT GTT CTA AGA AAC AGG GCT CAG 2 544 

Asp Tyr Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gin 
835 840 845 

AGT GGC AGT GTC CCT CAG TTC AAG AAA GTA GTA TTC CAG GAA TTT ACC 2 5 92 

Ser Gly Ser Val Pro Gin Phe Lys Lys Val Val Phe Gin Glu Phe Thr 
850 855 860 

GAT GGC TCC TTT ACT CAA CCC TTA TAC CGT GGA GAA CTA AAT GAA CAT 2 64 0 

Asp Gly Ser Phe Thr Gin Pro Leu Tyr Arg Gly Glu Leu Asn Glu His 
865 870 875 880 

TTG GGA CTC CTG GGG CCA TAT ATA AGA GCA GAA GTT GAA GAT AAT ATC 26 8 8 

Leu Gly Leu Leu Gly Pro Tyr lie Arg Ala Glu Val Glu Asp Asn lie 
885 890 895 
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ATG GTT ACC TTC AGA AAT CAG GCC TCT CGT CCC TAT TCC TTC TAT TCT 2 736 

Met Val Thr Phe Arg Asn Gin Ala Ser Arg Pro Tyr Ser Phe Tyr Ser 
900 905 910 

TCC CTC ATA TCA TAT GAG GAA GAT CAG AGG CAA GGA GCA GAA CCT AGA 2 784 

Ser Leu lie Ser Tyr Glu Glu Asp Gin Arg Gin Gly Ala Glu Pro Arg 
915 920 925 

AAA AAC TTT GTC AAG CCT AAT GAA ACC AAA ACT TAC TTT TGG AAA GTG 2 8 32 

Lys Asn Phe Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val 
930 935 940 

CAA CAT CAT ATG GCA CCC ACT AAA GAT GAG TTT GAC TGC AAA GCC TGG 288 0 

Gin His His Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp 
945 950 955 960 

GCT TAT TTC TCC GAT GTC GAC CTG GAA AAA GAT GTG CAC TCA GGC CTG 2 92 8 

Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu 
965 970 975 

ATT GGA CCC CTT CTG GTC TGC CAC ACC AAC ACA CTG AAC CCT GCT CAT 2 976 

lie Gly Pro Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His 
980 985 990 

GGG AGA CAA GTG ACA GTA CAG GAA TTT GCT CTG TTT TTC ACC ATC TTC 3 024 

Gly Arg Gin Val Thr Val Gin. Glu Phe Ala Leu Phe Phe Thr lie Phe 
995 1000 1005 

GAT GAG ACC AAA AGC TGG TAC TTC ACT GAA AAT ATG GAA AGA AAC TGC 3 072 

Asp Glu Thr Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys 
1010 1015 1020 

AGG GCT CCC TGC AAT ATC CAG ATG GAA GAT CCC ACT TTT AAA GAG AAT 312 0 

Arg Ala Pro Cys Asn lie Gin Met Glu Asp Pro Thr Phe Lys Glu Asn 
1025 1030 1035 1040 

TAT CGC TTC CAT GCA ATC AAT GGC TAC ATA ATG GAT ACA CTA CCT GGC 316 8 

Tyr Arg Phe His Ala He Asn Gly Tyr He Met Asp Thr Leu Pro Gly 
1045 1050 1055 

TTA GTA ATG GCT CAG GAT CAA AGG ATT CGA TGG TAT CTG CTC AGC ATG 3216 
Leu Val Met Ala Gin Asp Gin Arg He Arg Trp Tyr Leu Leu Ser Met 
1060 1065 1070 

GGC AGC AAT GAA AAC ATC CAT TCT ATT CAT TTC TCC GGA CAT GTG TTC 3 2 64 

Gly Ser Asn Glu Asn He His Ser He His Phe Ser Gly His Val Phe 
1075 1080 1085 

ACT GTA CGA AAA AAA GAG GAG TAT AAA ATG GCA CTG TAC AAT CTC TAT 3 312 

Thr Val Arg Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr 
1090 1095 1100 

CCC GGA GTT TTC GAG ACA GTG GAA ATG TTA CCA TCC AAA GCT GGA ATT 336 0 

Pro Gly Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly lie 
1105 1110 1115 1120 

TGG CGG GTG GAA TGC CTT ATT GGC GAG CAT CTA CAT GCT GGG ATG AGC 34 08 
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Trp Arg Val Glu Cys Leu lie Gly Glu His Leu His Ala Gly Met Ser 
1125 1130 1135 

ACA CTT TTT CTG GTG TAC TCC AAT AAG TGT CAG ACT CCC CTG GGA ATG 34 56 

Thr Leu Phe Leu Val Tyr Ser Asn Lys Cys Gin Thr Pro Leu Gly Met 
1140 1145 1150 

GCT TCT GGA CAC ATT AGA GAT TTT CAG ATT ACA GCT TCA GGA CAA TAT 3 504 

Ala Ser Gly His lie Arg Asp Phe Gin lie Thr Ala Ser Gly Gin Tyr 
1155 1160 1165 

GGA CAG TGG GCC CCA AAG CTG GCC AGA CTT CAT TAT TCC GGA TCA ATC 3 5 52 

Gly Gin Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser lie 
1170 1175 1180 

AAT GCC TGG AGC ACC AAG GAG CCC TTT TCT TGG ATC AAA GTT GAC CTG 36 00 

Asn Ala Trp Ser Thr Lys Glu Pro Phe Ser Trp lie Lys Val Asp Leu 
1185 1190 1195 1200 

TTG GCA CCA ATG ATT ATT CAC GGC ATC AAG ACC CAG GGT GCC CGT CAG 364 8 

Leu Ala Pro Met He He His Gly He Lys Thr Gin Gly Ala Arg Gin 
1205 1210 1215 

AAG TTC TCC AGC CTC TAC ATC TCT CAA TTT ATC ATC ATG TAT AGT CTC 3696 
Lys Phe Ser Ser Leu Tyr He Ser Gin Phe He He Met Tyr Ser Leu 
1220 1225 1230 

GAT GGG AAG AAG TGG CAG ACT TAT CG?\ GGA AAT TCC ACT GGA ACC CTC 3 744 

Asp Gly Lys Lys Trp Gin Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu 
1235 1240 1245 

ATG GTC TTC TTT GGC AAT GTG GAT TCA TCT GGG ATA AAA CAC AAT ATT 3 7 92 

Met Val Phe Phe Gly Asn Val Asp Ser Ser Gly He Lys His Asn He 
1250 1255 1260 

TTC AAC CCT CCA ATT ATT GCT CGA TAC ATC CGT TTG CAC CCA ACT CAT 3 84 0 

Phe Asn Pro Pro He He Ala Arg Tyr He Arg Leu His Pro Thr His 
1265 1270 1275 1280 

TAT AGC ATT CGC AGC ACT CTT CGC ATG GAG TTG ATG GGC TGT GAT TTA 3 88 8 

Tyr Ser He Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu 
1285 1290 1295 

AAT AGT TGC AGC ATG CCA TTG GGA ATG GAG AGT AAA GCA ATA TCA GAT 3 93 6 

Asn Ser Cys Ser Met Pro Leu Gly Met Glu Ser Lys Ala He Ser Asp 
1300 1305 1310 

GCA CAG ATT ACT GCT TCA TCC TAC TTT ACC AAT ATG TTT GCC ACC TGG 3 984 

Ala Gin He Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp 
1315 1320 1325 

TCT CCT TCA AAA GCT CGA CTA CAC CTA CAA GGG AGG AGT AAT GCC TGG 4 032 

Ser Pro Ser Lys Ala Arg Leu His Leu Gin Gly Arg Ser Asn Ala Trp 
1330 1335 1340 

AGA CCT CAA GTT AAC AAT CCA AAA GAG TGG CTG CAA GTG GAC TTC CAG 4 08 0 

Arg Pro Gin Val Asn Asn Pro Lys Glu Trp Leu Gin Val Asp Phe Gin 
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1345 1350 1355 1360 

AAG ACA ATG AAA GTC ACA GGA GTA ACT ACT CAG GGA GTA AAA TCT CTG 412 8 

Lys Thr Met Lys Val Thr Gly Val Thr Thr Gin Gly Val Lys Ser Leu 
1365 1370 1375 

CTT ACC TCT ATG TAC GTG AAG GAG TTC CTC ATA TCG TCG TCG CAA GAT 4176 
Leu Thr Ser Met Tyr Val Lys Glu Phe Leu lie Ser Ser Ser Gin Asp 
1380 1385 1390 

GGC CAT CAG TGG ACT CTC TTT TTT CAA AAT GGC AAA GTA AAA GTT TTC 42 24 

Gly His Gin Trp Thr Leu Phe Phe Gin Asn Gly Lys Val Lys Val Phe 
1395 1400 1405 

CAG GGA AAT CAA GAC TCC TTC ACA CCT GTC GTG AAC TCT CTA GAC CCA 42 72 

Gin Gly Asn Gin Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro 
1410 1415 1420 

CCG TTA CTC ACT CGC TAC CTT CGA ATT CAC CCC CAG AGT TGG GTG CAC 432 0 

Pro Leu Leu Thr Arg Tyr Leu Arg lie His Pro Gin Ser Trp Val His 
1425 1430 1435 1440 

CAG ATT GCC CTG AGG ATG GAG GTT CTG GGC TGC GAG GCA CAG GAC CTC 436 8 

Gin lie Ala Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gin Asp Leu 
1445 1450 1455 

TAC TG 43 74 

Tyr 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9164 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1006.. 5376 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

GTCGACGGTA TCGATAAGCT TGATATCGAA TTCCTGCAGC CCGGGGGATC CACTAGTACT 6 0 

CGAGACCTAG GAGTTAATTT TTAAAAAGCA GTCAAAAGTC CAAGTGGCCC TTGCGAGCAT 12 0 

TTACTCTCTC TGTTTGCTCT GGTTAATAAT CTCAGGAGCA CAAACATTCC TTACTAGTCC 18 0 

TAGAAGTTAA TTTTTAAAAA GCAGTCAAAA GTCCAAGTGG CCCTTGCGAG CATTTACTCT 24 0 

CTCTGTTTGC TCTGGTTAAT AATCTCAGGA GCACAAACAT TCCTTACTAG TTCTAGAGCG 3 00 

GCCGCCAGTG TGCTGGAATT CGGCTTTTTT AGGGCTGGAA GCTACCTTTG ACATCATTTC 36 0 
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CTCTGCGAAT GCATGTATAA TTTCTACAGA ACCTATTAGA AAGGATCACC CAGCCTCTGC 42 0 

TTTTGTACAA CTTTCCCTTA AAAAACTGCC AATTCCACTG CTGTTTGGCC CAATAGTGAG 480 

AACTTTTTCC TGCTGCCTCT TGGTGCTTTT GCCTATGGCC CCTATTCTGC CTGCTGAAGA 54 0 

CACTCTTGCC AGCATGGACT TAAACCCCTC CAGCTCTGAC AATCCTCTTT CTCTTTTGTT 600 

TTACATGAAG GGTCTGGCAG CCAAAGCAAT CACTCAAAGT TCAAACCTTA TCATTTTTTG 660 

CTTTGTTCCT CTTGGCCTTG GTTTTGTACA TCAGCTTTGA AAATACCATC CCAGGGTTAA 72 0 

TGCTGGGGTT AATTTATAAC TAAGAGTGCT CTAGTTTTGC AATACAGGAC ATGCTATAAA 780 

AATGGAAAGA TGTTGCTTTC TGAGAGATCT CGAGGAAGCT AACAACAAAG AACAACAAAC 84 0 

AACAATCAGG TAAGTATCCT TTTTACAGCA CAACTTAATG AGACAGATAG AAACTGGTCT 900 

TGTAGAAACA GAGTAGTCGC CTGCTTTTCT GCCAGGTGCT GACTTCTCTC CCCTTCTCTT 960 

TTTTCCTTTT CTCAGGATAA CAAGAACGAA ACAATAACAG CCACC ATG GAA ATA 1014 

Met Glu lie 
1 

GAG CTC TCC ACC TGC TTC TTT CTG TGC CTT TTG CGA TTC TGC TTT AGT 1062 
Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe Cys Phe Ser 
5 10 15 

GCC ACC AGA AGA TAC TAC CTG GGT GCA GTG GAA CTG TCA TGG GAC TAT 1110 
Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser Trp Asp Tyr 
20 25 30 35 

ATG CAA AGT GAT CTC GGT GAG CTG CCT GTG GAC GCA AGA TTT CCT CCT 115 8 

Met Gin Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg Phe Pro Pro 
40 45 50 

AGA GTG CCA AAA TCT TTT CCA TTC AAC ACC TCA GTC GTG TAC AAA AAG 12 06 

Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val Tyr Lys Lys 
55 60 65 

ACT CTG TTT GTA GAA TTC ACG GTT CAC CTT TTC AAC ATC GCT AAG CCA 12 54 

Thr Leu Phe Val Glu Phe Thr Val His Leu Phe Asn lie Ala Lys Pro 
70 75 80 

AGG CCA CCC TGG ATG GGT CTG CTA GGT CCT ACC ATC CAG GCT GAG GTT 13 02 

Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr lie Gin Ala Glu Val 
85 90 95 

TAT GAT ACA GTG GTC ATT ACA CTT AAG AAC ATG GCT TCC CAT CCT GTC 13 50 

Tyr Asp Thr Val Val lie Thr Leu Lys Asn Met Ala Ser His Pro Val 
100 105 110 115 

AGT CTT CAT GCT GTT GGT GTA TCC TAC TGG AAA GCT TCT GAG GGA GCT 13 98 

Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser Glu Gly Ala 
120 125 130 

GAA TAT GAT GAT CAG ACC AGT CAA AGG GAG AAA GAA GAT GAT AAA GTC 144 6 
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Glu Tyr Asp Asp Gin Thr Ser Gin Arg Glu Lys Glu Asp Asp Lys Val 
135 140 145 

TTC CCT GGT GGA AGC CAT ACA TAT GTC TGG CAG GTC CTG AAA GAG AAT 14 94 

Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gin Val Leu Lys Glu Asn 
150 155 160 

GGT CCA ATG GCC TCT GAC CCA CTG TGC CTT ACC TAC TCA TAT CTT TCT 1542 
Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser Tyr Leu Ser 
165 170 175 

CAT GTG GAC CTG GTA AAA GAC TTG AAT TCA GGC CTC ATT GGA GCC CTA 1590 
His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu lie Gly Ala Leu 
180 185 190 195 

CTA GTA TGT AGA GAA GGG AGT CTG GCC AAG GAA AAG ACA CAG ACC TTG 16 3 8 

Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr Gin Thr Leu 
200 205 210 

CAC AAA TTT ATA CTA CTT TTT GCT GTA TTT GAT GAA GGG AAA AGT TGG 16 86 

His Lys Phe lie Leu Leu Phe Ala Val Phe Asp Glu Gly Lys Ser Trp 
215 220 225 

CAC TCA GAA ACA AAG AAC TCC TTG ATG CAG GAT AGG GAT GCT GCA TCT 1734 
His Ser Glu Thr Lys Asn Ser Leu Met Gin Asp Arg Asp Ala Ala Ser 
230 235 240 

GCT CGG GCC TGG CCT AAA ATG CAC ACA GTC AAT GGT TAT GTA AAC AGG 1782 
Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr Val Asn Arg 
245 250 255 

TCT CTG CCA GGT CTG ATT GGA TGC CAC AGG AAA TCA GTC TAT TGG CAT 18 3 0 

Ser Leu Pro Gly Leu lie Gly Cys His Arg Lys Ser Val Tyr Trp His 
260 265 270 275 

GTG ATT GGA ATG GGC ACC ACT CCT GAA GTG CAC TCA ATA TTC CTC GAA 18 78 

Val lie Gly Met Gly Thr Thr Pro Glu Val His Ser lie Phe Leu Glu 
280 285 290 

GGT CAC ACA TTT CTT GTG AGG AAC CAT CGC CAG GCG TCC TTG GAA ATC 1926 
Gly His Thr Phe Leu Val Arg Asn His Arg Gin Ala Ser Leu Glu lie 
295 300 305 

TCG CCA ATA ACT TTC CTT ACT GCT CAA ACA CTC TTG ATG GAC CTT GGA 1974 
Ser Pro He Thr Phe Leu Thr Ala Gin Thr Leu Leu Met Asp Leu Gly 
310 315 320 

CAG TTT CTA CTG TTT TGT CAT ATC TCT TCC CAC CAA CAT GAT GGC ATG 2 022 

Gin Phe Leu Leu Phe Cys His He Ser Ser His Gin His Asp Gly Met 
325 330 335 

GAA GCT TAT GTC AAA GTA GAC AGC TGT CCA GAG GAA CCC CAA CTA CGA 2 070 

Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro Gin Leu Arg 
340 345 350 355 

ATG AAA AAT AAT GAA GAA GCG GAA GAC TAT GAT GAT GAT CTT ACT GAT 2118 
Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp Leu Thr Asp 
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360 365 370 

TCT GAA ATG GAT GTG GTC AGG TTT GAT GAT GAC AAC TCT CCT TCC TTT 2166 
Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser Pro Ser Phe 
375 380 385 

ATC CAA ATT CGC TCA GTT GCC AAG AAG CAT CCT AAA ACT TGG GTA CAT 2 214 

lie Gin lie Arg Ser Val Ala Lys Lys His Pro Lys Thr Trp Val His 
390 395 400 

TAC ATT GCT GCT GAA GAG GAG GAC TGG GAC TAT GCT CCC TTA GTC CTC 22 62 

Tyr lie Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro Leu Val Leu 
405 410 415 

GCC CCC GAT GAC AGA AGT TAT AAA AGT CAA TAT TTG AAC AAT GGC CCT 2 310 

Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gin Tyr Leu Asn Asn Gly Pro 
420 425 430 435 

CAG CGG ATT GGT AGG AAG TAC AAA AAA GTC CGA TTT ATG GCA TAC ACA 2 358 

Gin Arg lie Gly Arg Lys Tyr Lys Lys Val Arg Phe Met Ala Tyr Thr 
440 445 450 

GAT GAA ACC TTT AAG ACT CGT GAA GCT ATT CAG CAT GAA TCA GGA ATC 24 06 

Asp Glu Thr Phe Lys Thr Arg Glu Ala lie Gin His Glu Ser Gly lie 
455 460 465 

TTG GGA CCT TTA CTT TAT GGG GAA GTT GGA GAC ACA CTG TTG ATT ATA 24 54 

Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu Leu lie lie 
470 475 480 

TTT AAG AAT CAA GCA AGC AGA CCA TAT AAC ATC TAC CCT CAC GGA ATC 2 502 

Phe Lys Asn Gin Ala Ser Arg Pro Tyr Asn lie Tyr Pro His Gly lie 
485 490 495 

ACT GAT GTC CGT CCT TTG TAT TCA AGG AGA TTA CCA AAA GGT GTA AAA 2 550 

Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys Gly Val Lys 
500 505 510 515 

CAT TTG AAG GAT TTT CCA ATT CTG CCA GGA GAA ATA TTC AAA TAT AAA 2 5 98 

His Leu Lys Asp Phe Pro lie Leu Pro Gly Glu lie Phe Lys Tyr Lys 
520 525 530 

TGG ACA GTG ACT GTA GAA GAT GGG CCA ACT AAA TCA GAT CCT CGG TGC 264 6 

Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp Pro Arg Cys 
535 540 545 

CTG ACC CGC TAT TAC TCT AGT TTC GTT AAT ATG GAG AGA GAT CTA GCT 26 94 

Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg Asp Leu Ala 
550 555 560 

TCA GGA CTC ATT GGC CCT CTC CTC ATC TGC TAC AAA GAA TCT GTA GAT 2 742 

Ser Gly Leu lie Gly Pro Leu Leu lie Cys Tyr Lys Glu Ser Val Asp 
565 570 575 

CAA AGA GGA AAC CAG ATA ATG TCA GAC AAG AGG AAT GTC ATC CTG TTT 2 7 90 

Gin Arg Gly Asn Gin lie Met Ser Asp Lys Arg Asn Val lie Leu Phe 
580 585 590 595 
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TCT GTA TTT GAT GAG AAC CGA AGC TGG TAC CTG ACA GAG AAT ATA CAA 28 38 

Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu Asn lie Gin 
600 605 610 

CGC TTT CTC CCC AAT CCA GCT GGA GTG CAG CTT GAG GAT CCA GAG TTC 2886 
Arg Phe Leu Pro Asn Pro Ala Gly Val Gin Leu Glu Asp Pro Glu Phe 
615 620 625 

CAA GCC TCC AAC ATC ATG CAC AGC ATC AAT GGC TAT GTT TTT GAT AGT 2 934 

Gin Ala Ser Asn lie Met His Ser lie Asn Gly Tyr Val Phe Asp Ser 
630 635 640 

TTG CAG TTG TCA GTT TGT TTG CAT GAG GTG GCA TAC TGG TAC ATT CTA 2 982 

Leu Gin Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp Tyr lie Leu 
645 650 655 

AGC ATT GGA GCA CAG ACT GAC TTC CTT TCT GTC TTC TTC TCT GGA TAT 303 0 

Ser lie Gly Ala Gin Thr Asp Phe Leu Ser Val Phe Phe Ser Gly Tyr 
660 665 670 675 

ACC TTC AAA CAC AAA ATG GTC TAT GAA GAC ACA CTC ACC CTA TTC CCA 3 078 

Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr Leu Phe Pro 
680 685 690 

TTC TCA GGA GAA ACT GTC TTC ATG TCG ATG GAA AAC CCA GGT CTA TGG 3126 
Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro Gly Leu Trp 
695 700 705 

ATT CTG GGG TGC CAC AAC TCA GAC TTT CGG AAC AGA GGC ATG ACC GCC 3174 
lie Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly Met Thr Ala 
710 715 720 

TTA CTG AAG GTT TCT AGT TGT GAC AAG AAC ACT GGT GAT TAT TAC GAG 3 22 2 

Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp Tyr Tyr Glu 
725 730 735 

GAC AGT TAT GAA GAT ATT TCA GCA TAC TTG CTG AGT AAA AAC AAT GCC 3 2 70 

Asp Ser Tyr Glu Asp lie Ser Ala Tyr Leu Leu Ser Lys Asn Asn Ala 
740 745 750 755 

ATT GAA CCA AGA AGC TTC TCC CAG AAC CCA CCA GTC TTG AAA CGC CAT 3 318 

lie Glu Pro Arg Ser Phe Ser Gin Asn Pro Pro Val Leu Lys Arg His 
760 765 770 

CAA CGG GAA ATA ACT CGT ACT ACT CTT CAG TCA GAT CAA GAG GAA ATT 3 3 66 

Gin Arg Glu lie Thr Arg Thr Thr Leu Gin Ser Asp Gin Glu Glu lie 
775 780 785 

GAC TAT GAT GAT ACC ATA TCA GTT GAA ATG AAG AAG GAA GAT TTT GAC 3414 
Asp Tyr Asp Asp Thr lie Ser Val Glu Met Lys Lys Glu Asp Phe Asp 
790 795 800 

ATT TAT GAT GAG GAT GAA AAT CAG AGC CCC CGC AGC TTT CAA AAG AAA 3462 
lie Tyr Asp Glu Asp Glu Asn Gin Ser Pro Arg Ser Phe Gin Lys Lys 
805 810 815 
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ACA CGA CAC TAT TTT ATT GCT GCA GTG GAG AGG CTC TGG GAT TAT GGG 3 510 

Thr Arg His Tyr Phe lie Ala Ala Val Glu Arg Leu Trp Asp Tyr Gly 
820 825 830 835 

ATG AGT AGC TCC CCA CAT GTT CTA AGA AAC AGG GCT CAG AGT GGC AGT 3 558 

Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gin Ser Gly Ser 
840 845 850 

GTC CCT CAG TTC AAG AAA GTT GTT TTC CAG GAA TTT ACT GAT GGC TCC 3606 
Val Pro Gin Phe Lys Lys Val Val Phe Gin Glu Phe Thr Asp Gly Ser 
855 860 865 

TTT ACT CAG CCC TTA TAC CGT GGA GAA CTA AAT GAA CAT TTG GGA CTC 36 54 

Phe Thr Gin Pro Leu Tyr Arg Gly Glu Leu Asn Glu His Leu Gly Leu 
870 875 880 

CTG GGG CCA TAT ATA AGA GCA GAA GTT GAA GAT AAT ATC ATG GTA ACT 3 702 

Leu Gly Pro Tyr lie Arg Ala Glu Val Glu Asp Asn lie Met Val Thr 
885 890 895 

TTC AGA AAT CAG GCC TCT CGT CCC TAT TCC TTC TAT TCT AGC CTT ATT 3 75 0 

Phe Arg Asn Gin Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Ser Leu lie 
900 905 910 915 

TCT TAT GAG GAA GAT CAG AGG CAA GGA GCA GAA CCT AGA AAA AAC TTT 3 7 98 

Ser Tyr Glu Glu Asp Gin Arg Gin Gly Ala Glu Pro Arg Lys Asn Phe 
920 925 930 

GTC AAG CCT AAT GAA ACC AAA ACT TAC TTT TGG AAA GTG CAA CAT CAT 3 84 6 

Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val Gin His His 
935 940 945 

ATG GCA CCC ACT AAA GAT GAG TTT GAC TGC AAA GCC TGG GCT TAT TTC 3 8 94 

Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp Ala Tyr Phe 
950 955 960 

TCT GAT GTT GAC CTG GAA AAA GAT GTG CAC TCA GGC CTG ATT GGA CCC 3 942 

Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu lie Gly Pro 
965 970 975 

CTT CTG GTC TGC CAC ACT AAC ACA CTG AAC CCT GCT CAT GGG AGA CAA 3 990 

Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His Gly Arg Gin 
980 985 990 995 

GTG ACA GTA CAG GAA TTT GCT CTG TTT TTC ACC ATC TTT GAT GAG ACC 403 8 

Val Thr Val Gin Glu Phe Ala Leu Phe Phe Thr lie Phe Asp Glu Thr 
1000 1005 1010 

AAA AGC TGG TAC TTC ACT GAA AAT ATG GAA AGA AAC TGC AGG GCT CCC 4 086 

Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys Arg Ala Pro 
1015 1020 1025 

TGC AAT ATC CAG ATG GAA GAT CCC ACT TTT AAA GAG AAT TAT CGC TTC 4134 
Cys Asn lie Gin Met Glu Asp Pro' Thr Phe Lys Glu Asn Tyr Arg Phe 
1030 1035 1040 

CAT GCA ATC AAT GGC TAC ATA ATG GAT ACA CTA CCT GGC TTA GTA ATG 4182 
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His Ala lie Asn Gly Tyr lie Met Asp Thr Leu Pro Gly Leu Val Met 
1045 1050 1055 

GCT CAG GAT CAA AGG ATT CGA TGG TAT CTG CTC AGC ATG GGC AGC AAT 42 30 

Ala Gin Asp Gin Arg lie Arg Trp Tyr Leu Leu Ser Met Gly Ser Asn 
1060 1065 1070 1075 

GAA AAC ATC CAT TCT ATT CAT TTC AGT GGA CAT GTG TTC ACT GTA CGA 4278 
Glu Asn lie His Ser lie His Phe Ser Gly His Val Phe Thr Val Arg 
1080 1085 1090 

AAA AAA GAG GAG TAT AAA ATG GCA CTG TAC AAT CTC TAT CCA GGT GTT 432 6 

Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr Pro Gly Val 
1095 1100 1105 

TTT GAG ACA GTG GAA ATG TTA CCA TCC AAA GCT GGA ATT TGG CGG GTG 4 3 74 

Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly lie Trp Arg Val 
1110 ills 1120 

GAA TGC CTT ATT GGC GAG CAT CTA CAT GCT GGG ATG AGC ACA CTT TTT 4422 
Glu Cys Leu lie Gly Glu His Leu His Ala Gly Met Ser Thr Leu Phe 
1125 1130 1135 

CTG GTG TAC AGC AAT AAG TGT CAG ACT CCC CTG GGA ATG GCT TCT GGA 44 70 

Leu Val Tyr Ser Asn Lys Cys Gin Thr Pro Leu Gly Met Ala Ser Gly 
1140 1145 1150 1155 

CAC ATT AGA GAT TTT CAG ATT ACA GCT TCA GGA CAA TAT GGA CAG TGG 4 518 

His lie Arg Asp Phe Gin lie Thr Ala Ser Gly Gin Tyr Gly Gin Trp 
1160 1165 1170 

GCC CCA AAG CTG GCC AGA CTT CAT TAT TCC GGA TCA ATC AAT GCC TGG 4 566 

Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser lie Asn Ala Trp 
1175 1180 1185 

AGC ACC AAG GAG CCC TTT TCT TGG ATC AAG GTG GAT CTG TTG GCA CCA 4 614 

Ser Thr Lys Glu Pro Phe Ser Trp lie Lys Val Asp Leu Leu Ala Pro 
1190 1195 1200 

ATG ATT ATT CAC GGC ATC AAG ACC CAG GGT GCC CGT CAG AAG TTC TCC 4 662 

Met lie lie His Gly lie Lys Thr Gin Gly Ala Arg Gin Lys Phe Ser 
1205 1210 1215 

AGC CTC TAC ATC TCT CAG TTT ATC ATC ATG TAT AGT CTT GAT GGG AAG 4 710 

Ser Leu Tyr lie Ser Gin Phe lie lie Met Tyr Ser Leu Asp Gly Lys 
1220 1225 1230 1235 

AAG TGG CAG ACT TAT CGA GGA AAT TCC ACT GGA ACC TTA ATG GTC TTC 4 758 

Lys Trp Gin Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu Met Val Phe 
1240 1245 1250 

TTT GGC AAT GTG GAT TCA TCT GGG ATA AAA CAC AAT ATT TTT AAC CCT 4 806 

Phe Gly Asn Val Asp Ser Ser Gly lie Lys His Asn lie Phe Asn Pro 
1255 1260 1265 

CCA ATT ATT GCT CGA TAC ATC CGT TTG CAC CCA ACT CAT TAT AGC ATT 4 854 

Pro lie lie Ala Arg Tyr lie Arg Leu His Pro Thr His Tyr Ser lie 
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1270 1275 1280 

CGC AGC ACT CTT CGC ATG GAG TTG ATG GGC TGT GAT TTA AAT AGT TGC 4 902 

Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu Asn Ser Cys 
1285 1290 1295 

AGC ATG CCA TTG GGA ATG GAG AGT AAA GCA ATA TCA GAT GCA CAG ATT 4 950 

Ser Met Pro Leu Gly Met Glu Ser Lys Ala lie Ser Asp Ala Gin lie 
1300 1305 1310 1315 

ACT GCT TCA TCC TAC TTT ACC AAT ATG TTT GCC ACC TGG TCT CCT TCA 4 998 

Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp Ser Pro Ser 
1320 1325 1330 

AAA GCT CGA CTT CAC CTC CAA GGG AGG AGT AAT GCC TGG AGA CCT CAG 5046 
Lys Ala Arg Leu His Leu Gin Gly Arg Ser Asn Ala Trp Arg Pro Gin 
1335 1340 1345 

GTG AAT AAT CCA AAA GAG TGG CTG CAA GTG GAC TTC CAG AAG ACA ATG 50 94 

Val Asn Asn Pro Lys Glu Trp Leu Gin Val Asp Phe Gin Lys Thr Met 
1350 1355 1360 

AAA GTC ACA GGA GTA ACT ACT CAG GGA GTA AAA TCT CTG CTT ACC AGC 5142 
Lys Val Thr Gly Val Thr Thr Gin Gly Val Lys Ser Leu Leu Thr Ser 
1365 1370 1375 

ATG TAT GTG AAG GAG TTC CTC ATC TCC AGC AGT CAA GAT GGC CAT CAG 5190 
Met Tyr Val Lys Glu Phe Leu lie Ser Ser Ser Gin Asp Gly His Gin 
1380 1385 1390 1395 

TGG ACT CTC TTT TTT CAG AAT GGC AAA GTA AAG GTT TTT CAG GGA AAT 52 3 8 

Trp Thr Leu Phe Phe Gin Asn Gly Lys Val Lys Val Phe Gin Gly Asn 
1400 1405 1410 

CAA GAC TCC TTC ACA CCT GTG GTG AAC TCT CTA GAC CCA CCG TTA CTG 52 8 6 

Gin Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro Pro Leu Leu 
1415 1420 1425 

ACT CGC TAC CTT CGA ATT CAC CCC CAG AGT TGG GTG CAC CAG ATT GCC 5334 
Thr Arg Tyr Leu Arg lie His Pro Gin Ser Trp Val His Gin lie Ala 
1430 1435 1440 

CTG AGG ATG GAG GTT CTG GGC TGC GAG GCA CAG GAC CTC TAC 53 76 

Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gin Asp Leu Tyr 
1445 1450 1455 

TGAGGGTGGC CACTGCAGCA CCTGCCACTG CCGTCACCTC TCCCTCCTCA GCTCCAGGGC 54 3 6 

AGTGTCCCTC CCTGGCTTGC CTTCTACCTT TGTGCTAAAT CCTAGCAGAC ACTGC CTTG A 54 96 

AGCCTCCTGA ATTAACTATC ATCAGTCCTG CATTTCTTTG GTGGGGGGCC AGGAGGGTGC 5556 

ATCCAATTTA ACTTAACTCT TACCTATTTT CTGCAGCTGC TCCCAGATTA CTCCTTCCTT 5616 

CCAATATAAC TAGGCAAAAA GAAGTGAGGA GAAACCTGCA TGAAAGCATT CTTCCCTGAA 5676 

AAGTTAGGCC TCTCAGAGTC ACCACTTCCT CTGTTGTAGA AAAACTATGT GATGAAACTT 5736 
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TGAAAAAGAT ATTTATGATG TTAACTTGTT TATTGCAGCT TATAATGGTT ACAAATAAAG 57 96 

CAATAGCATC ACAAATTTCA CAAATAAAGC ATTTTTTTCA CTGCATTCTA GTTGTGGTTT 5856 

GTCCAAACTC ATCAATGTAT CTTATCATGT CTGGATCCCC GGGTGGCATC CCTGTGACCC 5916 

CTCCCCAGTG CCTCTCCTGG CCCTGGAAGT TGCCACTCCA GTGCCCACCA GCCTTGTCCT 5976 

AATAAAATTA AGTTGCATCA TTTTGTCTGA CTAGGTGTCC TTCTATAATA TTATGGGGTG 603 6 

GAGGGGGGTG GTATGGAGCA AGGGGCAAGT TGGGAAGACA ACCTGTAGGG CCTGCGGGGT 60 96 

CTATTCGGGA ACCAAGCTGG AGTGCAGTGG CACAATCTTG GCTCACTGCA ATCTCCGCCT 6156 

CCTGGGTTCA AGCGATTCTC CTGCCTCAGC CTCCCGAGTT GTTGGGATTC CAGGCATGCA 6216 

TGACCAGGCT CAGCTAATTT TTGTTTTTTT GGTAGAGACG GGGTTTCACC ATATTGGCCA 62 76 

GGCTGGTCTC CAACTCCTAA TCTCAGGTGA TCTACCCACC TTGGCCTCCC AAATTGCTGG 6336 

GATTACAGGC GTGAACCACT GCTCCCTTCC CTGTCCTTCT GATTTTAAAA TAACTATACC 63 96 

AGCAGGAGGA CGTCCAGACA CAGCATAGGC TACCTGCCAT GCCCAACCGG TGGGACATTT 64 56 

GAGTTGCTTG CTTGGCACTG TCCTCTCATG CGTTGGGTCC ACTCAGTAGA TGC CTGTTG A 6 516 

ATTCGTAATC ATGGTCATAG CTGTTTCCTG TGTGAAATTG TTATCCGCTC ACAATTCCAC 6576 

ACAACATACG AGCCGGAAGC ATAAAGTGTA AAGCCTGGGG TGCCTAATGA GTGAGCTAAC 6636 

TCACATTAAT TGCGTTGCGC TCACTGCCCG CTTTCCAGTC GGGAAACCTG TCGTGCCAGC 66 96 

TGCATTAATG AATCGGCCAA CGCGCGGGGA GAGGCGGTTT GCGTATTGGG CGCTCTTCCG 6 756 

CTTCCTCGCT CACTGACTCG CTGCGCTCGG TCGTTCGGCT GCGGCGAGCG GTATCAGCTC 6 816 

ACTCAAAGGC GGTAATACGG TTATCCACAG AATCAGGGGA TAACGCAGGA AAGAACATGT 68 76 

GAGCAAAAGG CCAGCAAAAG GCCAGGAACC GTAAAAAGGC CGCGTTGCTG GCGTTTTTCC 6 93 6 

ATAGGCTCCG CCCCCCTGAC GAGCATCACA AAAATCGACG CTCAAGTCAG AGGTGGCGAA 6996 

ACCCGACAGG ACTATAAAGA TACCAGGCGT TTCCCCCTGG AAGCTCCCTC GTGCGCTCTC 70 56 

CTGTTCCGAC CCTGCCGCTT ACCGGATACC TGTCCGCCTT TCTCCCTTCG GGAAGCGTGG 7116 

CGCTTTCTCA TAGCTCACGC TGTAGGTATC TCAGTTCGGT GTAGGTCGTT CGCTCCAAGC 7176 

TGGGCTGTGT GCACGAACCC CCCGTTCAGC CCGACCGCTG CGCCTTATCC GGTAACTATC 72 3 6 

GTCTTGAGTC CAACCCGGTA AGACACGACT TATCGCCACT GGCAGCAGCC ACTGGTAACA 7296 

GGATTAGCAG AGCGAGGTAT GTAGGCGGTG CTACAGAGTT CTTGAAGTGG TGGC CTAACT 7356 

ACGGCTACAC TAGAAGGACA GTATTTGGTA TCTGCGCTCT GCTGAAGCCA GTTACCTTCG 7416 
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GAAAAAGAGT TGGTAGCTCT TGATCCGGCA AACAAACCAC CGCTGGTAGC GGTGGTTTTT 74 76 

TTGTTTGCAA GCAGCAGATT ACGCGCAGAA AAAAAGGATC TCAAGAAGAT CCTTTGATCT 75 3 6 

TTTCTACGGG GJTCTGACGCT CAGTGGAACG AAAACTCACG TTAAGGGATT TTGGTCATGA 7596 

GATTATCAAA AAGGATCTTC ACCTAGATCC TTTTAAATTA AAAATGAAGT TTTAAATCAA 7656 

TCTAAAGTAT ATATGAGTAA ACTTGGTCTG AC AGTT AC C A ATGCTTAATC AG TGAGG C AC 7716 

CTATCTCAGC GATCTGTCTA TTTCGTTCAT CCATAGTTGC CTGACTCCCC GTCGTGTAGA 77 76 

TAACTACGAT ACGGGAGGGC TTACCATCTG GCCCCAGTGC TGCAATGATA CCGCGAGACC 78 3 6 

CACGCTCACC GGCTCCAGAT TTATCAGCAA TAAACCAGCC AGCCGGAAGG GCCGAGCGCA 78 96 

GAAGTGGTCC TGCAACTTTA TCCGCCTCCA TCCAGTCTAT TAATTGTTGC CGGGAAGCTA 7 956 

GAGTAAGTAG TTCGCCAGTT AATAGTTTGC GCAACGTTGT TGCCATTGCT ACAGGCATCG 8 016 

TGGTGTCACG CTCGTCGTTT GGTATGGCTT CATTCAGCTC CGGTTCCCAA CGATCAAGGC 8076 

GAGTTACATG ATCCCCCATG TTGTGCAAAA AAGCGGTTAG CTCCTTCGGT CCTCCGATCG 8136 

TTGTCAGAAG TAAGTTGGCC GCAGTGTTAT CACTCATGGT TATGGCAGCA CTGCATAATT 8196 

CTCTTACTGT CATGCCATCC GTAAGATGCT TTTCTGTGAC TGGTGAGTAC TCAACCAAGT 82 56 

CATTCTGAGA ATAGTGTATG CGGCGACCGA GTTGCTCTTG CCCGGCGTCA ATACGGGATA 8316 

ATACCGCGCC ACATAGCAGA ACTTTAAAAG TGCTCATCAT TGGAAAACGT TCTTCGGGGC 8 376 

GAAAACT CTC AAGGATCTTA CCGCTGTTGA GATCCAGTTC GATGTAACCC ACTCGTGCAC 84 36 

CCAACTGATC TTCAGCATCT TTTACTTTCA CCAGCGTTTC TGGGTGAGCA AAAACAGGAA 84 96 

GGCAAAATGC CGCAAAAAAG GGAATAAGGG CGACACGGAA ATGTTGAATA CTCATACTCT 8556 

TCCTTTTTCA ATATTATTGA AGCATTTATC AGGGTTATTG TCTCATGAGC GGATACATAT 8616 

TTGAATGTAT TTAGAAAAAT AAACAAATAG GGGTTCCGCG CACATTTCCC CGAAAAGTGC 86 76 

CACCTGACGT CTAAGAAACC ATTATTATCA TGACATTAAC CTATAAAAAT AGGCGTATCA 8736 

CGAGGCCCTT TCGTCTCGCG CGTTTCGGTG ATGACGGTGA AAACCTCTGA CACATGCAGC 8 7 96 

TCCCGGAGAC GGTCACAGCT TGTCTGTAAG CGGATGCCGG GAGCAGACAA GCCCGTCAGG 8 8 56 

GCGCGTCAGC GGGTGTTGGC GGGTGTCGGG GCTGGCTTAA CTATGCGGCA TCAGAGCAGA 8 916 

TTGTACTGAG AGTGCACCAT ATGCGGTGTG AAATACCGCA CAGATGCGTA AGGAGAAAAT 8976 

ACCGCATCAG GCGCCATTCG CCATTCAGGC TGCGCAACTG TTGGGAAGGG CGATCGGTGC 9036 

GGGCCTCTTC GCTATTACGC CAGCTGGCGA AAGGGGGATG TGCTGCAAGG CGATTAAGTT 9096 

GGGTAACGCC AGGGTTTTCC CAGTCACGAC GTTGTAAAAC GACGGCCAGT GCCAAGCTTG 9156 
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GGCTGCAG 9164 
(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1006 3294 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 6153 .. 8234 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GTCGACGGTA TCGATAAGCT TGATATCGAA TTCCTGCAGC CCGGGGGATC CACTAGTACT 6 0 

CGAGACCTAG GAGTTAATTT TTAAAAAGCA GTCAAAAGTC CAAGTGGCCC TTGCGAGCAT 12 0 

TTACTCTCTC TGTTTGCTCT GGTTAATAAT CTCAGGAGCA CAAACATTCC TTACTAGTCC 18 0 

TAGAAGTTAA TTTTTAAAAA GCAGTCAAAA GTCCAAGTGG CCCTTGCGAG CATTTACTCT 24 0 

CTCTGTTTGC TCTGGTTAAT AATCTCAGGA GCACAAACAT TCCTTACTAG TTCTAGAGCG 300 

GCCGCCAGTG TGCTGGAATT CGGCTTTTTT AGGGCTGGAA GCTACCTTTG ACATCATTTC 360 

CTCTGCGAAT GCATGTATAA TTTCTACAGA ACCTATTAGA AAGGATC AC C CAGCCTCTGC 42 0 

TTTTGTACAA CTTTCCCTTA AAAAACTGCC AATTCCACTG CTGTTTGGCC CAATAGTGAG 480 

AACTTTTTCC TGCTGCCTCT TGGTGCTTTT GCCTATGGCC CCTATTCTGC CTGCTGAAGA 54 0 

CACTCTTGCC AGCATGGACT TAAACCCCTC CAGCTCTGAC AATCCTCTTT CTCTTTTGTT 600 

TTACATGAAG GGTCTGGCAG CCAAAGCAAT CACTCAAAGT TCAAACCTTA TCATTTTTTG 66 0 

CTTTGTTCCT CTTGGCCTTG GTTTTGTACA TCAGCTTTGA AAATACCATC CCAGGGTTAA 72 0 

TGCTGGGGTT AATTTATAAC TAAGAGTGCT CTAGTTTTGC AATACAGGAC ATGCTATAAA 78 0 

AATGGAAAGA TGTTGCTTTC TGAGAGATCT CGAGGAAGCT AACAACAAAG AACAACAAAC 84 0 

AACAATCAGG TAAGTATCCT TTTTACAGCA CAACTTAATG AGACAGATAG AAACTGGTCT 900 

TGTAGAAACA GAGTAGTCGC CTGCTTTTCT GCCAGGTGCT GACTTCTCTC CCCTTCTCTT 960 

TTTTCCTTTT CTCAGGATAA CAAGAACGAA ACAATAACAG CCACC ATG GAA ATA 1014 

Met Glu lie 
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GAG CTC TCC ACC TGC TTC TTT CTG TGC CTT TTG CGA TTC TGC TTT AGT 1062 
Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe Cys Phe Ser 
5 10 15 

GCC ACC AGA AGA TAC TAC CTG GGT GCA GTG GAA CTG TCA TGG GAC TAT 1110 
Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser Trp Asp Tyr 
20 25 30 35 

ATG CAA AGT GAT CTC GGT GAG CTG CCT GTG GAC GCA AGA TTT CCT CCT 115 8 

Met Gin Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg Phe Pro Pro 
40 45 50 

AGA GTG CCA AAA TCT TTT CCA TTC AAC ACC TCA GTC GTG TAC AAA AAG 12 06 

Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val Tyr Lys Lys 
55 60 65 

ACT CTG TTT GTA GAA TTC ACG GTT CAC CTT TTC AAC ATC GCT AAG CCA 12 54 

Thr Leu Phe Val Glu Phe Thr Val His Leu Phe Asn lie Ala Lys Pro 
70 75 80 

AGG CCA CCC TGG ATG GGT CTG CTA GGT CCT ACC ATC CAG GCT GAG GTT 13 02 

Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr lie Gin Ala Glu Val 
85 90 95 

TAT GAT ACA GTG GTC ATT ACA CTT AAG AAC ATG GCT TCC CAT CCT GTC 13 5 0 

Tyr Asp Thr Val Val lie Thr Leu Lys Asn Met Ala Ser His Pro Val 
100 105 110 115 

AGT CTT CAT GCT GTT GGT GTA TCC TAC TGG AAA GCT TCT GAG GGA GCT 13 98 

Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser Glu Gly Ala 
120 ' 125 130 

GAA TAT GAT GAT CAG ACC AGT CAA AGG GAG AAA GAA GAT GAT AAA GTC 144 6 

Glu Tyr Asp Asp Gin Thr Ser Gin Arg Glu Lys Glu Asp Asp Lys Val 
135 140 145 

TTC CCT GGT GGA AGC CAT ACA TAT GTC TGG CAG GTC CTG AAA GAG AAT 14 94 

Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gin Val Leu Lys Glu Asn 
150 155 160 

GGT CCA ATG GCC TCT GAC CCA CTG TGC CTT ACC TAC TCA TAT CTT TCT 154 2 

Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser Tyr Leu Ser 
165 170 175 

CAT GTG GAC CTG GTA AAA GAC TTG AAT TCA GGC CTC ATT GGA GCC CTA 15 90 

His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu lie Gly Ala Leu 
180 185 190 195 

CTA GTA TGT AGA GAA GGG AGT CTG GCC AAG GAA AAG ACA CAG ACC TTG 16 3 8 

Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr Gin Thr Leu 
200 205 210 

CAC AAA TTT ATA CTA CTT TTT GCT GTA TTT GAT GAA GGG AAA AGT TGG 1686 
His Lys Phe lie Leu Leu Phe Ala Val Phe Asp Glu Gly Lys Ser Trp 
215 220 225 
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CAC TCA GAA ACA AAG AAC TCC TTG ATG CAG GAT AGG GAT GCT GCA TCT 1734 
His Ser Glu Thr Lys Asn Ser Leu Met Gin Asp Arg Asp Ala Ala Ser 
230 235 240 

GCT CGG GCC TGG CCT AAA ATG CAC ACA GTC AAT GGT TAT GTA AAC AGG 1782 
Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr Val Asn Arg 
245 250 255 

TCT CTG CCA GGT CTG ATT GGA TGC CAC AGG AAA TCA GTC TAT TGG CAT 183 0 

Ser Leu Pro Gly Leu lie Gly Cys His Arg Lys Ser Val Tyr Trp His 
260 265 t 270 275 

GTG ATT GGA ATG GGC ACC ACT CCT GAA GTG CAC TCA ATA TTC CTC GAA 1878 
Val lie Gly Met Gly Thr Thr Pro Glu Val His Ser lie Phe Leu Glu 
280 285 290 

GGT CAC ACA TTT CTT GTG AGG AAC CAT CGC CAG GCG TCC TTG GAA ATC 192 6 

Gly His Thr Phe Leu Val Arg Asn His Arg Gin Ala Ser Leu Glu lie 
295 300 305 

TCG CCA ATA ACT TTC CTT ACT GCT CAA ACA CTC TTG ATG GAC CTT GGA 1974 
Ser Pro lie Thr Phe Leu Thr Ala Gin Thr Leu Leu Met Asp Leu Gly 
310 315 320 

CAG TTT CTA CTG TTT TGT CAT ATC TCT TCC CAC CAA CAT GAT GGC ATG 2 022 

Gin Phe Leu Leu Phe Cys His lie Ser Ser His Gin His Asp Gly Met 
325 330 335 

GAA GCT TAT GTC AAA GTA GAC AGC TGT CCA GAG GAA CCC CAA CTA CGA 2 07 0 

Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro Gin Leu Arg 
340 345 350 355 

ATG AAA AAT AAT GAA GAA GCG GAA GAC TAT GAT GAT GAT CTT ACT GAT 2118 
Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp Leu Thr Asp 
360 365 370 

TCT GAA ATG GAT GTG GTC AGG TTT GAT GAT GAC AAC TCT CCT TCC TTT 216 6 

Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser Pro Ser Phe 
375 380 385 

ATC CAA ATT CGC TCA GTT GCC AAG AAG CAT CCT AAA ACT TGG GTA CAT 2214 
lie Gin lie Arg Ser Val Ala Lys Lys His Pro Lys Thr Trp Val His 
390 395 400 

TAC ATT GCT GCT GAA GAG GAG GAC TGG GAC TAT GCT CCC TTA GTC CTC 2262 
Tyr lie Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro Leu Val Leu 
405 410 415 

GCC CCC GAT GAC AGA AGT TAT AAA AGT CAA TAT TTG AAC AAT GGC CCT 2 310 

Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gin Tyr Leu Asn Asn Gly Pro J 
420 425 430 435 

CAG CGG ATT GGT AGG AAG TAC AAA AAA GTC CGA TTT ATG GCA TAC ACA 2 3 58 

Gin Arg lie Gly Arg Lys Tyr Lys Lys Val Arg Phe Met Ala Tyr Thr 
440 445 450 
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GAT GAA ACC TTT AAG ACT CGT GAA GCT ATT CAG CAT GAA TCA GGA ATC 24 06 

Asp Glu Thr Phe Lys Thr Arg Glu Ala lie Gin His Glu Ser Gly lie 
455 460 465 

TTG GGA CCT TTA CTT TAT GGG GAA GTT GGA GAC ACA CTG TTG ATT ATA 24 54 

Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu Leu lie lie 
470 475 480 

TTT AAG AAT CAA GCA AGC AGA CCA TAT AAC ATC TAC CCT CAC GGA ATC 2 502 

Phe Lys Asn Gin Ala Ser Arg Pro Tyr Asn lie Tyr Pro His Gly lie 
485 490 495 

ACT GAT GTC CGT CCT TTG TAT TCA AGG AGA TTA CCA AAA GGT GTA AAA 25 50 
Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys Gly Val Lys 
500 505 510 515 

CAT TTG AAG GAT TTT CCA ATT CTG CCA GGA GAA ATA TTC AAA TAT AAA 2 5 98 

His Leu Lys Asp Phe Pro lie Leu Pro Gly Glu lie Phe Lys Tyr Lys 
520 525 530 

TGG ACA GTG ACT GTA GAA GAT GGG CCA ACT AAA TCA GAT CCT CGG TGC 2 646 

Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp Pro Arg Cys 
535 540 545 

CTG ACC CGC TAT TAC TCT AGT TTC GTT AAT ATG GAG AGA GAT CTA GCT 2 6 94 

Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg Asp Leu Ala 
550 555 560 

TCA GGA CTC ATT GGC CCT CTC CTC ATC TGC TAC AAA GAA TCT GTA GAT 2 742 

Ser Gly Leu lie Gly Pro Leu Leu lie Cys Tyr Lys Glu Ser Val Asp 
565 570 575 

CAA AGA GGA AAC CAG ATA ATG TCA GAC AAG AGG AAT GTC ATC CTG TTT 2 7 90 

Gin Arg Gly Asn Gin lie Met Ser Asp Lys Arg Asn Val lie Leu Phe 
580 585 590 595 

TCT GTA TTT GAT GAG AAC CGA AGC TGG TAC CTC ACA GAG AAT ATA CAA 2 83 8 

Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu Asn lie Gin 
600 605 610 

CGC TTT CTC CCC AAT CCA GCT GGA GTG CAG CTT GAG GAT CCA GAG TTC 2 8 86 

Arg Phe Leu Pro Asn Pro Ala Gly Val Gin Leu Glu Asp Pro Glu Phe 
615 620 625 

CAA GCC TCC AAC ATC ATG CAC AGC ATC AAT GGC TAT GTT TTT GAT AGT 2 934 

Gin Ala Ser Asn lie Met His Ser lie Asn Gly Tyr Val Phe Asp Ser 
630 635 640 

TTG CAG TTG TCA GTT TGT TTG CAT GAG GTG GCA TAC TGG TAC ATT CTA 2 982 

Leu Gin Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp Tyr lie Leu 
645 650 655 

AGC ATT GGA GCA CAG ACT GAC TTC CTT TCT GTC TTC TTC TCT GGA TAT 30 3 0 

Ser lie Gly Ala Gin Thr Asp Phe Leu Ser Val Phe Phe Ser Gly Tyr 
660 665 670 675 

ACC TTC AAA CAC AAA ATG GTC TAT GAA GAC ACA CTC ACC CTA TTC CCA 307 8 
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Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr Leu Phe Pro 
680 685 690 

TTC TCA GGA GAA ACT GTC TTC ATG TCG ATG GAA AAC CCA GGT CTA TGG 312 6 

Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro Gly Leu Trp 
695 700 705 

ATT CTG GGG TGC CAC AAC TCA GAC TTT CGG AAC AGA GGC ATG ACC GCC 3174 
lie Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly Met Thr Ala 
710 715 720 

TTA CTG AAG GTT TCT AGT TGT GAC AAG AAC ACT GGT GAT TAT TAC GAG 3 22 2 

Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp Tyr Tyr Glu 
725 730 735 

GAC AGT TAT GAA GAT ATT TCA GCA TAC TTG CTG AGT AAA AAC AAT GCC 32 70 

Asp Ser Tyr Glu Asp lie Ser Ala Tyr Leu Leu Ser Lys Asn Asn Ala 
740 745 750 755 

ATT GAA CCA AGA AGC TTC TCC CAG GTAAGTTATT ATATAAATTC AAGACACCCT 3 324 
lie Glu Pro Arg Ser Phe Ser Gin 
760 

AGCACTAGGC AAAAGCAATT TAATGCCACC ACAATTCCAG AAAATGACAT AGAGAAGACT 3 3 84 

GACCCTTGGT TTGCACACAG AACACCTATG CCTAAAATAC AAAATGTCTC CTCTAGTGAT 3444 

TTGTTGATGC TCTTGCGACA GAGTCCTACT CCACATGGGC TATCCTTATC TGATCTCCAA 3 5 04 

GAAGCCAAAT ATGAGACTTT TTCTGATGAT CCATCACCTG GAGCAATAGA CAGTAATAAC 3 564 

AGCCTGTCTG AAATGACACA CTTCAGGCCA CAGCTCCATC ACAGTGGGGA CATGGTATTT 3 624 

ACCCCTGAGT CAGGCCTCCA ATTAAGATTA AATGAGAAAC TGGGGACAAC TGCAGCAACA 3 6 84 

GAGTTGAAGA AACTTGATTT CAAAGTTTCT AGTACATCAA ATAATCTGAT TTCAACAATT 3 744 

C CATC AGACA ATTTGGCAGC AGGTACTGAT AATACAAGTT CCTTAGGACC CCCAAGTATG 3 804 

CCAGTTCATT ATGATAGTCA ATTAGATACC ACTCTATTTG GCAAAAAGTC ATCTCCCCTT 3 864 

ACTGAGTCTG GTGGACCTCT GAGCTTGAGT GAAGAAAATA ATGATTCAAA GTTGTTAGAA 3 924 

TCAGGTTTAA TGAATAGCCA AGAAAGTTCA TGGGGAAAAA ATGTATCGTC AACAGAGAGT 3 984 

GGTAGGTTAT TTAAAGGGAA AAGAGCTCAT GGACCTGCTT TGTTGACTAA AGATAATGCC 4 044 

TTATTCAAAG TTAGCATCTC TTTGTTAAAG ACAAACAAAA CTTCCAATAA TTCAGCAACT 4104 

AATAGAAAGA CTCACATTGA TGGCCCATCA TTATTAATTG AGAATAGTCC ATCAGTCTGG 4164 

CAAAATATAT TAGAAAGTGA CACTGAGTTT AAAAAAGTGA CACCTTTGAT TCATGACAGA 42 24 

ATGCTTATGG ACAAAAATGC TACAGCTTTG AGGCTAAATC ATATGTCAAA TAAAACTACT 4 2 84 

TC AT CAAAAA ACATGGAAAT GGTCCAACAG AAAAAAGAGG GCCCCATTCC ACCAGATGCA 4 34 4 
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CAAAATCCAG ATATGTCGTT CTTTAAGATG CTATTCTTGC CAGAATCAGC AAGGTGGATA 44 04 

CAAAGGACTC ATGGAAAGAA CTCTCTGAAC TCTGGGCAAG GCCCCAGTCC AAAGCAATTA 4464 

GTATCCTTAG GACCAGAAAA ATCTGTGGAA GGTCAGAATT TCTTGTCTGA GAAAAACAAA 4 524 

GTGGTAGTAG GAAAGGGTGA ATTTACAAAG GACGTAGGAC TCAAAGAGAT GGTTTTTCCA 4 584 

AGCAGCAGAA ACCTATTTCT TACTAACTTG GATAATTTAC ATGAAAATAA TACACACAAT 4644 

CAAGAAAAAA AAATTCAGGA AGAAATAGAA AAGAAGGAAA CATTAATCCA AGAGAATGTA 4 704 

GTTTTGCCTC AGATACATAC AGTGACTGGC ACTAAGAATT TCATGAAGAA CCTTTTCTTA 4 764 

CTGAGCACTA GGCAAAATGT AGAAGGTTCA TATGAGGGGG CATATGCTCC AGTACTTCAA 4 824 

GATTTTAGGT CATTAAATGA TTCAACAAAT AGAACAAAGA AACACACAGC TCATTTCTCA 4 8 84 

AAAAAAGGGG AGGAAGAAAA CTTGGAAGGC TTGGGAAATC AAAC CAAGC A AATTGTAGAG 4 944 

AAATATGCAT GCACCACAAG GATATCTCCT AATACAAGCC AGCAGAATTT TGTCACGCAA 5004 

CGTAGTAAGA GAGCTTTGAA ACAATTCAGA CTCCCACTAG AAGAAACAGA ACTTGAAAAA 5 064 

AGGATAATTG TGGATGACAC CTCAACCCAG TGGTCCAAAA ACATGAAACA TTTGACCCCG 512 4 

AGCACCCTCA CACAGATAGA CTACAATGAG AAGGAGAAAG GGGCCATTAC TCAGTCTCCC 5184 

TTATCAGATT GCCTTACGAG GAGTCATAGC ATCCCTCAAG CAAATAGATC TCCATTACCC 5244 

ATTGCAAAGG TATCATCATT TCCATCTATT AGACCTATAT ATCTGACCAG GGTCCTATTC 53 04 

CAAGACAACT CTTCTCATCT TCCAGCAGCA TCTTATAGAA AGAAAGATTC TGGGGTCCAA 5364 

GAAAGCAGTC ATTTCTTACA AGGAGCCAAA AAAAATAACC TTTCTTTAGC CATTCTAACC 5424 

TTGGAGATGA CTGGTGATCA AAGAGAGGTT GGCTCCCTGG GGACAAGTGC CACAAATTCA 54 84 

GTCACATACA AGAAAGTTGA GAACACTGTT CTCCCGAAAC CAGACTTGC C CAAAACATCT 5544 

GGCAAAGTTG AATTGCTTCC AAAAGTTCAC ATTTATCAGA AGGACCTATT CCCTACGGAA 56 04 

ACTAGCAATG GGTCTCCTGG CCATCTGGAT CTCGTGGAAG GGAGCCTTCT TCAGGGAACA 5664 

GAGGGAGCGA TTAAGTGGAA TGAAGCAAAC AGACCTGGAA AAGTTCCCTT TCTGAGAGTA 5724 

GCAACAGAAA GCTCTGCAAA GACTCCCTCC AAGCTATTGG ATCCTCTTGC TTGGGATAAC 5 784 

CACTATGGTA CTCAGATACC AAAAGAAGAG TGGAAATCCC AAGAGAAGTC ACCAGAAAAA 5844 

ACAGCTTTTA AGAAAAAGGA T AC C ATTTTG TCCCTGAACG CTTGTGAAAG CAATCATGCA 5 904 

ATAGCAGCAA TAAATGAGGG ACAAAATAAG CCCGAAATAG AAGTCACCTG GGCAAAGCAA 5964 

GGTAGGACTG AAAGGCTGTG CTCTCAATTG TGC TAAT AAA GCTTGGCAAG AGTATTTCAA 6024 

GGAAGATGAA GTCATTAACT ATGCAAAATG CTTCTCAGGC ACCTAGGAAA ATGAGGATGT 6084 
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GAGGCATTTC TACCCACTTG GTACATAAAA TTATTGGGTC ACCCTTTTCC TCTTCTTTTT 6144 

TTCTCCAG AAC CCA CCA GTC TTG AAA CGC CAT CAA CGG GAA ATA ACT CGT 6194 
Asn Pro Pro Val Leu Lys Arg His Gin Arg Glu lie Thr Arg 
15 10 

ACT ACT CTT CAG TCA GAT CAA GAG GAA ATT GAC TAT GAT GAT ACC ATA 6242 
Thr Thr Leu Gin Ser Asp Gin Glu Glu lie Asp Tyr Asp Asp Thr lie 
15 20 25 30 

TCA GTT GAA ATG AAG AAG GAA GAT TTT GAC ATT TAT GAT GAG GAT GAA 62 90 

Ser Val Glu Met Lys Lys Glu Asp Phe Asp lie Tyr Asp Glu Asp Glu 
35 40 45 

AAT CAG AGC CCC CGC AGC TTT CAA AAG AAA ACA CGA CAC TAT TTT ATT 63 3 8 

Asn Gin Ser Pro Arg Ser Phe Gin Lys Lys Thr Arg His Tyr Phe lie 
50 55 60 

GCT GCA GTG GAG AGG CTC TGG GAT TAT GGG ATG AGT AGC TCC CCA CAT 6 3 86 

Ala Ala Val Glu Arg Leu Trp Asp Tyr Gly Met Ser Ser Ser Pro His 
65 70 75 

GTT CTA AGA AAC AGG GCT CAG AGT GGC AGT GTC CCT CAG TTC AAG AAA 6434 
Val Leu Arg Asn Arg Ala Gin Ser Gly Ser Val Pro Gin Phe Lys Lys 
80 85 90 

GTT GTT TTC CAG GAA TTT ACT GAT GGC TCC TTT ACT CAG CCC TTA TAC 64 82 

Val Val Phe Gin Glu Phe Thr Asp Gly Ser Phe Thr Gin Pro Leu Tyr 
95 100 105 110 

CGT GGA GAA CTA AAT GAA CAT TTG GGA CTC CTG GGG CCA TAT ATA AGA 6 53 0 

Arg Gly Glu Leu Asn Glu His Leu Gly Leu Leu Gly Pro Tyr lie Arg 
115 120 125 

GCA GAA GTT GAA GAT AAT ATC ATG GTA ACT TTC AGA AAT CAG GCC TCT 6 5 78 

Ala Glu Val Glu Asp Asn lie Met Val Thr Phe Arg Asn Gin Ala Ser 
130 135 140 

CGT CCC TAT TCC TTC TAT TCT AGC CTT ATT TCT TAT GAG GAA GAT CAG 6626 
Arg Pro Tyr Ser Phe Tyr Ser Ser Leu lie Ser Tyr Glu Glu Asp Gin 
145 150 155 

AGG CAA GGA GCA GAA CCT AGA AAA AAC TTT GTC AAG CCT AAT GAA ACC 66 74 

Arg Gin Gly Ala Glu Pro Arg Lys Asn Phe Val Lys Pro Asn Glu Thr 
160 165 170 

AAA ACT TAC TTT TGG AAA GTG CAA CAT CAT ATG GCA CCC ACT AAA GAT 6 722 

Lys Thr Tyr Phe Trp Lys Val Gin His His Met Ala Pro Thr Lys Asp 
175 180 185 190 

GAG TTT GAC TGC AAA GCC TGG GCT TAT TTC TCT GAT GTT GAC CTG GAA 677 0 

Glu Phe Asp Cys Lys Ala Trp Ala Tyr Phe Ser Asp Val Asp Leu Glu 
195 200 205 

AAA GAT GTG CAC TCA GGC CTG ATT GGA CCC CTT CTG GTC TGC CAC ACT 6 818 

Lys Asp Val His Ser Gly Leu lie Gly Pro Leu Leu Val Cys His Thr 
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210 215 220 

AAC ACA CTG AAC CCT GCT CAT GGG AGA CAA GTG ACA GTA CAG GAA TTT 6866 
Asn Thr Leu Asn Pro Ala His Gly Arg Gin Val Thr Val Gin Glu Phe 
225 230 235 

GCT CTG TTT TTC ACC ATC TTT GAT GAG ACC AAA AGC TGG TAC TTC ACT 6 914 

Ala Leu Phe Phe Thr lie Phe Asp Glu Thr Lys Ser Trp Tyr Phe Thr 
240 245 250 

GAA AAT ATG GAA AGA AAC TGC AGG GCT CCC TGC AAT ATC CAG ATG GAA 6 962 

Glu Asn Met Glu Arg Asn Cys Arg Ala Pro Cys Asn lie Gin Met Glu 
255 260 265 270 

GAT CCC ACT TTT AAA GAG AAT TAT CGC TTC CAT GCA ATC AAT GGC TAC 7 010 

Asp Pro Thr Phe Lys Glu Asn Tyr Arg Phe His Ala lie Asn Gly Tyr 
275 280 285 

ATA ATG GAT ACA CTA CCT GGC TTA GTA ATG GCT CAG GAT CAA AGG ATT 7 058 

lie Met Asp Thr Leu Pro Gly Leu Val Met Ala Gin Asp Gin Arg lie 
290 295 300 

CGA TGG TAT CTG CTC AGC ATG GGC AGC AAT GAA AAC ATC CAT TCT ATT 7106 
Arg Trp Tyr Leu Leu Ser Met Gly Ser Asn Glu Asn lie His Ser lie 
305 310 315 

CAT TTC AGT GGA CAT GTG TTC ACT GTA CGA AAA AAA GAG GAG TAT AAA 7154 
His Phe Ser Gly His Val Phe Thr Val Arg Lys Lys Glu Glu Tyr Lys 
320 325 330 

ATG GCA CTG TAC AAT CTC TAT CCA GGT GTT TTT GAG ACA GTG GAA ATG 72 02 

Met Ala Leu Tyr Asn Leu Tyr Pro Gly Val Phe Glu Thr Val Glu Met 
335 340 345 350 

TTA CCA TCC AAA GCT GGA ATT TGG CGG GTG GAA TGC CTT ATT GGC GAG 72 5 0 

Leu Pro Ser Lys Ala Gly lie Trp Arg Val Glu Cys Leu lie Gly Glu 
355 360 365 

CAT CTA CAT GCT GGG ATG AGC ACA CTT TTT CTG GTG TAC AGC AAT AAG 7298 
His Leu His Ala Gly Met Ser Thr Leu Phe Leu Val Tyr Ser Asn Lys 
370 375 380 

TGT CAG ACT CCC CTG GGA ATG GCT TCT GGA CAC ATT AGA GAT TTT CAG 734 6 

Cys Gin Thr Pro Leu Gly Met Ala Ser Gly His lie Arg Asp Phe Gin 
385 390 395 

ATT ACA GCT TCA GGA CAA TAT GGA CAG TGG GCC CCA AAG CTG GCC AGA 7394 
lie Thr Ala Ser Gly Gin Tyr Gly Gin Trp Ala Pro Lys Leu Ala Arg 
400 * 405 410 

CTT CAT TAT TCC GGA TCA ATC AAT GCC TGG AGC ACC AAG GAG CCC TTT 7442 
Leu His Tyr Ser Gly Ser lie Asn Ala Trp Ser Thr Lys Glu Pro Phe 
415 420 425 430 

TCT TGG ATC AAG GTG GAT CTG TTG GCA CCA ATG ATT ATT CAC GGC ATC 74 90 

Ser Trp lie Lys Val Asp Leu Leu Ala Pro Met lie lie His Gly lie 
435 440 445 
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AAG ACC CAG GGT GCC CGT CAG AAG TTC TCC AGC CTC TAC ATC TCT CAG 7538 
Lys Thr Gin Gly Ala Arg Gin Lys Phe Ser Ser Leu fyr lie Ser Gin 
450 455 460 

TTT ATC ATC ATG TAT AGT CTT GAT GGG AAG AAG TGG CAG ACT TAT CGA 7586 
Phe lie lie Met Tyr Ser Leu Asp Gly Lys Lys Trp Gin Thr Tyr Arg 
465 470 475 

GGA AAT TCC ACT GGA ACC TTA ATG GTC TTC TTT GGC AAT GTG GAT TCA 7634 
Gly Asn Ser Thr Gly Thr Leu Met Val Phe Phe Gly Asn Val Asp Ser 
480 485 490 

TCT GGG ATA AAA CAC AAT ATT TTT AAC CCT CCA ATT ATT GCT CGA TAC 76 82 

Ser Gly lie Lys His Asn lie Phe Asn Pro Pro lie lie Ala Arg Tyr 
495 500 505 510 

ATC CGT TTG CAC CCA ACT CAT TAT AGC ATT CGC AGC ACT CTT CGC ATG 7 73 0 

lie Arg Leu His Pro Thr His Tyr Ser lie Arg Ser Thr, Leu Arg Met 
515 520 525 

GAG TTG ATG GGC TGT GAT TTA AAT AGT TGC AGC ATG CCA TTG GGA ATG 77 78 

Glu Leu Met Gly Cys Asp Leu Asn Ser Cys Ser Met Pro Leu Gly Met 
530 535 540 

GAG AGT AAA GCA ATA TCA GAT GCA CAG ATT ACT GCT TCA TCC TAC TTT 782 6 

Glu Ser Lys Ala lie Ser Asp Ala Gin lie Thr Ala Ser Ser Tyr Phe 
545 550 555 

ACC AAT ATG TTT GCC ACC TGG TCT CCT TCA AAA GCT CGA CTT CAC CTC 7 8 74 

Thr Asn Met Phe Ala Thr Trp Ser Pro Ser Lys Ala Arg Leu His Leu 
560 565 570 

CAA GGG AGG AGT AAT GCC TGG AGA CCT CAG GTG AAT AAT CCA AAA GAG 7 92 2 

Gin Gly Arg Ser Asn Ala Trp Arg Pro Gin Val Asn Asn Pro Lys Glu 
575 580 585 590 

TGG CTG CAA GTG GAC TTC CAG AAG ACA ATG AAA GTC ACA GGA GTA ACT 7 97 0 

Trp Leu Gin Val Asp Phe Gin Lys Thr Met Lys Val Thr Gly Val Thr 
595 600 605 

ACT CAG GGA GTA AAA TCT CTG CTT ACC AGC ATG TAT GTG AAG GAG TTC 8 018 

Thr Gin Gly Val Lys Ser Leu Leu Thr Ser Met Tyr Val Lys Glu Phe 
610 615 620 

CTC ATC TCC AGC AGT CAA GAT GGC CAT CAG TGG ACT CTC TTT TTT CAG 8 06 6 

Leu lie Ser Ser Ser Gin Asp Gly His Gin Trp Thr Leu Phe Phe Gin 
625 630 635 

AAT GGC AAA GTA AAG GTT TTT CAG GGA AAT CAA GAC TCC TTC ACA CCT 8114 
Asn Gly Lys Val Lys Val Phe Gin Gly Asn Gin Asp Ser Phe Thr Pro 
640 645 650 

GTG GTG AAC TCT CTA GAC CCA CCG TTA CTG ACT CGC TAC CTT CGA ATT 8162 
Val Val Asn Ser Leu Asp Pro Pro Leu Leu Thr Arg Tyr Leu Arg lie 
655 660 665 670 
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CAC CCC CAG AGT TGG GTG CAC CAG ATT GCC CTG AGG ATG GAG GTT CTG 8210 
His Pro Gin Ser Trp Val His Gin lie Ala Leu Arg Met Glu Val Leu 
675 680 685 

GGC TGC GAG GCA CAG GAC CTC TAC TGAGGGTGGC CACTGCAGCA CCTGCCACTG 8 264 
Gly Cys Glu Ala Gin Asp Leu Tyr 
690 

CCGTCACCTC TCCCTCCTCA GCTCCAGGGC AGTGTCCCTC CCTGGCTTGC CTTCTACCTT 8 324 

TGTGCTAAAT CCTAGCAGAC ACTGCCTTGA AGCCTCCTGA ATTAACTATC ATCAGTCCTG 8 384 

CATTTCTTTG GTGGGGGGCC AGGAGGGTGC ATCCAATTTA ACTTAACTCT TACCTATTTT 8444 

CTGCAGCTGC TCCCAGATTA CTCCTTCCTT CCAATATAAC TAGGCAAAAA GAAGTGAGGA 8 504 

GAAACCTGCA TGAAAGCATT CTTCCCTGAA AAGTTAGGCC TCTCAGAGTC ACCACTTCCT 8 564 

CTGTTGTAGA AAAACTATGT GATGAAACTT TGAAAAAGAT ATTTATGATG TTAACTTGTT 8624 

TATTGCAGCT TATAATGGTT ACAAATAAAG CAATAGCATC ACAAATTTCA CAAATAAAGC 8684 

ATTTTTTTCA CTGCATTCTA GTTGTGGTTT GTCCAAACTC ATCAATGTAT CTTATCATGT 8 744 

CTGGATCCCC GGGTGGCATC CCTGTGACCC CTCCCCAGTG CCTCTCCTGG CCCTGGAAGT 8 8 04 

TGCCACTCCA GTGCCCACCA GCCTTGTCCT AATAAAATTA AGTTGCATCA TTTTGTCTGA 8 8 64 

CTAGGTGTCC TTCTATAATA TTATGGGGTG GAGGGGGGTG GTATGGAGCA AGGGGCAAGT 8 924 

TGGGAAGACA ACCTGTAGGG CCTGCGGGGT CTATTCGGGA ACCAAGCTGG AGTGCAGTGG 8 984 

CACAATCTTG GCTCACTGCA ATCTCCGCCT CCTGGGTTCA AGCGATTCTC CTGCCTCAGC 9044 

CTCCCGAGTT GTTGGGATTC CAGGCATGCA TGACCAGGCT CAGCTAATTT TTGTTTTTTT 9104 

GGTAGAGACG GGGTTTCACC AT ATTGGC CA GGCTGGTCTC CAACTCCTAA TCTCAGGTGA 9164 

TCTACCCACC TTGGCCTCCC AAATTGCTGG GATTACAGGC GTGAACCACT GCTCCCTTCC 92 24 

CTGTCCTTCT GATTTTAAAA TAACTATACC AGCAGGAGGA CGTCCAGACA CAGCATAGGC 92 84 

TACCTGCCAT GCCCAACCGG TGGGACATTT GAGTTGCTTG CTTGGCACTG TCCTCTCATG 9344 

CGTTGGGTCC ACTCAGTAGA TGCCTGTTGA ATTCGTAATC ATGGTCATAG CTGTTTCCTG 94 04 

TGTGAAATTG TTATCCGCTC ACAATTCCAC ACAACATACG AGC CGGAAGC ATAAAGTGTA 94 64 

AAGCCTGGGG TGCCTAATGA GTGAGCTAAC TCACATTAAT TGCGTTGCGC TCACTGCCCG 9524 

CTTTCCAGTC GGGAAACCTG TCGTGCCAGC TGCATTAATG AATCGGCCAA CGCGCGGGGA 9584 

GAGGCGGTTT GCGTATTGGG CGCTCTTCCG CTTCCTCGCT CACTGACTCG CTGCGCTCGG 9644 

TCGTTCGGCT GCGGCGAGCG GTATCAGCTC ACTCAAAGGC GGTAATACGG TTATCCACAG 97 04 

AATCAGGGGA TAACGCAGGA AAGAACATGT GAGCAAAAGG CCAGCAAAAG GCCAGGAACC 9764 
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GTAAAAAGGC CGCGTTGCTG GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC GAGCATCACA 9824 
AAAATCGACG CTCAAGTCAG AGGTGGCGAA ACCCGACAGG ACTATAAAGA TACCAGGCGT 98 84 
TTCCCCCTGG AAGCTCCCTC GTGCGCTCTC CTGTTCCGAC CCTGCCGCTT ACCGGATACC 9944 
TGTCCGCCTT TCTCCCTTCG GGAAGCGTGG CGCTTTCTCA TAGCTCACGC TGTAGGTATC 10004 
TCAGTTCGGT GTAGGTCGTT CGCTCCAAGC TGGGCTGTGT GCACGAACCC CCCGTTCAGC 10064 
CCGACCGCTG CGCCTTATCC GGTAACTATC GTCTTGAGTC CAACCCGGTA AGACACGACT 10124 
TATCGCCACT GGCAGCAGCC ACTGGTAACA GGATTAGCAG AGCGAGGTAT GTAGGCGGTG 10184 
CTACAGAGTT CTTGAAGTGG TGGCCTAACT ACGGCTACAC TAGAAGGACA GTATTTGGTA 10244 
TCTGCGCTCT GCTGAAGCCA GTTACCTTCG GAAAAAGAGT TGGTAGCTCT TGATCCGGCA 10304 
AACAAACCAC CGCTGGTAGC GGTGGTTTTT TTGTTTGCAA GCAGCAGATT ACGCGCAGAA 10364 
AAAAAGGATC TCAAGAAGAT CCTTTGATCT TTTCTACGGG GTCTGACGCT CAGTGGAACG 10424 
AAAACTCACG TTAAGGGATT TTGGTCATGA GATTATCAAA AAGGATCTTC AC CTAGATCC 104 84 
TTTTAAATTA AAAATGAAGT TTTAAATCAA TCTAAAGTAT ATATGAGTAA ACTTGGTCTG 10 544 
ACAGTTACCA ATGCTTAATC AGTGAGGCAC CTATCTCAGC GATCTGTCTA TTTCGTTCAT 10604 
C CAT AGTTGC CTGACTCCCC GTCGTGTAGA TAACTACGAT ACGGGAGGGC TTACCATCTG 10664 
GCCCCAGTGC TGCAATGATA CCGCGAGACC CACGCTCACC GGCTCCAGAT TTATCAGCAA 10 724 
TAAACCAGCC AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC TGCAACTTTA TCCGCCTCCA 10 784 
TCCAGTCTAT TAATTGTTGC CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT AATAGTTTGC 10844 
GCAACGTTGT TGCCATTGCT ACAGGCATCG TGGTGTCACG CTCGTCGTTT GGTATGGCTT 10 904 
CATTCAGCTC CGGTTCCCAA CGATCAAGGC GAGTTACATG ATCCCCCATG TTGTGCAAAA 10964 
AAGCGGTTAG CTCCTTCGGT CCTCCGATCG TTGTCAGAAG TAAGTTGGCC GCAGTGTTAT 11024 
CACTCATGGT TATGGCAGCA CTGCATAATT CTCTTACTGT CATGCCATCC GTAAGATGCT 11084 
TTTCTGTGAC TGGTGAGTAC TCAACCAAGT CATTCTGAGA ATAGTGTATG CGGCGAC CG A 11144 
GTTGCTCTTG CCCGGCGTCA ATACGGGATA ATACCGCGCC ACATAGCAGA ACTTTAAAAG 11204 
TGCTCATCAT TGGAAAACGT TCTTCGGGGC GAAAACTCTC AAGGATCTTA CCGCTGTTGA 112 64 
GATCCAGTTC GATGTAACCC ACTCGTGCAC CCAACTGATC TTCAGCATCT TTTACTTTCA 11324 
CCAGCGTTTC TGGGTGAGCA AAAACAGGAA GGCAAAATGC CGCAAAAAAG GGAATAAGGG 11384 
CGACACGGAA ATGTTGAATA CTCATACTCT TCCTTTTTCA ATATTATTGA AGCATTTATC 11444 
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AGGGTTATTG TCTCATGAGC GGATACATAT TTGAATGTAT TTAGAAAAAT AAACAAATAG 11504 
GGGTTCCGCG CACATTTCCC CGAAAAGTGC CACCTGACGT CTAAGAAACC ATTATTATCA 11564 
TGACATTAAC CTATAAAAAT AGGCGTATCA CGAGGCCCTT TCGTCTCGCG CGTTTCGGTG 11624 
ATGACGGTGA AAACCTCTGA C AC ATGC AG C TCCCGGAGAC GGTCACAGCT TGTCTGTAAG 116 84 
CGGATGCCGG GAGCAGACAA GCCCGTCAGG GCGCGTCAGC GGGTGTTGGC GGGTGTC GGG 11744 
GCTGGCTTAA CTATGCGGCA TCAGAGCAGA TTGTACTGAG AGTGCACCAT ATGCGGTGTG 118 04 
AAATACCGCA CAGATGCGTA AGGAGAAAAT ACCGCATCAG GCGCCATTCG C CATTCAGGC 11864 
TGCGCAACTG TTGGGAAGGG CGATCGGTGC GGGCCTCTTC GCTATTACGC CAGCTGGCGA 11924 
AAGGGGGATG TGCTGCAAGG CGATTAAGTT GGGTAACGCC AGGGTTTTCC CAGTCACGAC 11984 
GTTGTAAAAC GACGGCCAGT GCCAAGCTTG GGCTGCAG 12 022 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1006.. 8058 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

GTCGACGGTA TCGATAAGCT TGATATCGAA TTCCTGCAGC CCGGGGGATC CACTAGTACT 6 0 

CG AG AC CT AG GAGTTAATTT TTAAAAAGCA GTCAAAAGTC CAAGTGGCCC TTGCGAGCAT 12 0 

TTACTCTCTC TGTTTGCTCT GGTTAATAAT CTCAGGAGCA CAAACATTCC TTACTAGTCC 18 0 

TAGAAGTTAA TTTTTAAAAA GCAGTCAAAA GTCCAAGTGG CCCTTGCGAG CATTTACTCT 24 0 

CTCTGTTTGC TCTGGTTAAT AATCTCAGGA G C AC AAAC AT TCCTTACTAG TTCTAGAGCG 3 00 

GCCGCCAGTG TGCTGGAATT CGGCTTTTTT AGGGCTGGAA GCTACCTTTG ACATCATTTC 360 

CTCTGCGAAT GCATGTATAA TTTCTACAGA ACCTATTAGA AAGGATCACC CAGCCTCTGC 42 0 

TTTTGTACAA CTTTCCCTTA AAAAACTGCC AATTCCACTG CTGTTTGGCC CAATAGTGAG 48 0 

AACTTTTTCC TGCTGCCTCT TGGTGCTTTT GCCTATGGCC CCTATTCTGC C TGCTGAAG A 54 0 

CACTCTTGCC AGCATGGACT TAAACCCCTC CAGCTCTGAC AATCCTCTTT CTCTTTTGTT 60 0 

TTACATGAAG GGTCTGGCAG CCAAAGCAAT CACTCAAAGT TCAAACCTTA TCATTTTTTG 66 0 
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CTTTGTTCCT CTTGGCCTTG GTTTTGTACA TCAGCTTTGA AAATACCATC CCAGGGTTAA 72 0 

TGCTGGGGTT AATTTATAAC TAAGAGTGCT CTAGTTTTGC AATACAGGAC ATGCTATAAA 780 

AATGGAAAGA TGTTGCTTTC TGAGAGATCT CGAGGAAGCT AACAACAAAG AACAACAAAC 84 0 

AACAATCAGG TAAGTATCCT TTTTACAGCA CAACTTAATG AGACAGATAG AAACTGGTCT 900 

TGTAGAAACA GAGTAGTCGC CTGCTTTTCT GCCAGGTGCT GACTTCTCTC CCCTTCTCTT 960 

TTTTCCTTTT CTCAGGATAA CAAGAACGAA ACAATAACAG CCACC ATG GAA ATA 1014 

Met Glu lie 
1 

GAG CTC TCC ACC TGC TTC TTT CTG TGC CTT TTG CGA TTC TGC TTT AGT 106 2 

Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe Cys Phe Ser 
5 10 15 

GCC ACC AGA AGA TAC TAC CTG GGT GCA GTG GAA CTG TCA TGG GAC TAT 1110 
Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser Trp Asp Tyr 
20 25 30 35 

ATG CAA AGT GAT CTC GGT GAG CTG CCT GTG GAC GCA AGA TTT CCT CCT 1158 
Met Gin Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg Phe Pro Pro 
40 45 50 - 

AGA GTG CCA AAA TCT TTT CCA TTC AAC ACC TCA GTC GTG TAC AAA AAG 12 06 

Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val Tyr Lys Lys 
55 60 65 

ACT CTG TTT GTA GAA TTC ACG GTT CAC CTT TTC AAC ATC GCT AAG CCA 12 54 

Thr Leu Phe Val Glu Phe Thr Val His Leu Phe Asn lie Ala Lys Pro 
70 75 80 

AGG CCA CCC TGG ATG GGT CTG CTA GGT CCT ACC ATC CAG GCT GAG GTT 13 02 

Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr lie Gin Ala Glu Val 
85 90 95 

TAT GAT ACA GTG GTC ATT ACA CTT AAG AAC ATG GCT TCC CAT CCT GTC 13 50 

Tyr Asp Thr Val Val lie Thr Leu Lys Asn Met Ala Ser His Pro Val 
100 105 110 115 

AGT CTT CAT GCT GTT GGT GTA TCC TAC TGG AAA GCT TCT GAG GGA GCT 13 98 

Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser Glu Gly Ala 
120 125 130 

GAA TAT GAT GAT CAG ACC AGT CAA AGG GAG AAA GAA GAT GAT AAA GTC 1446 
Glu Tyr Asp Asp Gin Thr Ser Gin Arg Glu Lys Glu Asp Asp Lys Val 
135 140 145 

TTC CCT GGT GGA AGC CAT ACA TAT GTC TGG CAG GTC CTG AAA GAG AAT 14 94 

Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gin Val Leu Lys Glu Asn 
150 155 160 

GGT CCA ATG GCC TCT GAC CCA CTG TGC CTT ACC TAC TCA TAT CTT TCT 1542 
Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser Tyr Leu Ser 
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165 170 175 

CAT GTG GAC CTG GTA AAA GAC TTG AAT TCA GGC CTC ATT GGA GCC CTA 1590 
His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu lie Gly Ala Leu 
180 185 190 195 

CTA GTA TGT AGA GAA GGG AGT CTG GCC AAG GAA AAG ACA CAG ACC TTG 163 8 

Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr Gin Thr Leu 
200 205 210 

CAC AAA TTT ATA CTA CTT TTT GCT GTA TTT GAT GAA GGG AAA AGT TGG 1686 
His Lys Phe lie Leu Leu Phe Ala Val Phe Asp Glu Gly Lys Ser Trp 
215 220 225 

CAC TCA GAA ACA AAG AAC TCC TTG ATG CAG GAT AGG GAT GCT GCA TCT 1734 
His Ser Glu Thr Lys Asn Ser Leu Met Gin Asp Arg Asp Ala Ala Ser 
230 235 240 

GCT CGG GCC TGG CCT AAA ATG CAC ACA GTC AAT GGT TAT GTA AAC AGG 1782 
Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr Val Asn Arg 
245 250 255 

TCT CTG CCA GGT CTG ATT GGA TGC CAC AGG AAA TCA GTC TAT TGG CAT 183 0 

Ser Leu Pro Gly Leu lie Gly Cys His Arg Lys Ser Val Tyr Trp His 
260 265 270 275 

GTG ATT GGA ATG GGC ACC ACT CCT GAA GTG CAC TCA ATA TTC CTC GAA 18 78 

Val lie Gly Met Gly Thr Thr Pro Glu Val His Ser lie Phe Leu Glu 
280 285 290 

GGT CAC ACA TTT CTT GTG AGG AAC CAT CGC CAG GCG TCC TTG GAA ATC 192 6 

Gly His Thr Phe Leu Val Arg Asn His Arg Gin Ala Ser Leu Glu lie 
295 300 305 

TCG CCA ATA ACT TTC CTT ACT GCT CAA ACA CTC TTG ATG GAC CTT GGA 19 74 

Ser Pro lie Thr Phe Leu Thr Ala Gin Thr Leu Leu Met Asp Leu Gly 
310 315 320 

CAG TTT CTA CTG TTT TGT CAT ATC TCT TCC CAC CAA CAT GAT GGC ATG 2 022 

Gin Phe Leu Leu Phe Cys His lie Ser Ser His Gin His Asp Gly Met 
325 330 335 

GAA GCT TAT GTC AAA GTA GAC AGC TGT CCA GAG GAA CCC CAA CTA CGA 2 070 

Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro Gin Leu Arg 
340 345 350 355 

ATG AAA AAT AAT GAA GAA GCG GAA GAC TAT GAT GAT GAT CTT ACT GAT 2118 
Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp Leu Thr Asp 
360 365 370 

TCT GAA ATG GAT GTG GTC AGG TTT GAT GAT GAC AAC TCT CCT TCC TTT 2166 
Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser Pro Ser Phe 
375 380 385 

ATC CAA ATT CGC TCA GTT GCC AAG AAG CAT CCT AAA ACT TGG GTA CAT 2214 
lie Gin lie Arg Ser Val Ala Lys Lys His Pro Lys Thr Trp Val His 
390 395 400 
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TAC ATT GCT GCT GAA GAG GAG GAC TGG GAC TAT GCT CCC TTA GTC CTC 2262 
Tyr lie Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro Leu Val Leu 
405 410 415 

GCC CCC GAT GAC AGA AGT TAT AAA AGT CAA TAT TTG AAC AAT GGC CCT 2 310 

Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gin Tyr Leu Asn Asn Gly Pro 
420 425 430 435 

CAG CGG ATT GGT AGG AAG TAC AAA AAA GTC CGA TTT ATG GCA TAC ACA 23 58 

Gin Arg lie Gly Arg Lys Tyr Lys Lys Val Arg Phe Met Ala Tyr Thr 
440 445 450 

GAT GAA ACC TTT AAG ACT CGT GAA GCT ATT CAG CAT GAA TCA GGA ATC 24 06 

Asp Glu Thr Phe Lys Thr Arg Glu Ala lie Gin His Glu Ser Gly lie 
455 460 465 

TTG GGA CCT TTA CTT TAT GGG GAA GTT GGA GAC ACA CTG TTG ATT ATA 24 54 

Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu Leu lie lie 
470 475 480 

TTT AAG AAT CAA GCA AGC AGA CCA TAT AAC ATC TAC CCT CAC GGA ATC 2 502 

Phe Lys Asn Gin Ala Ser Arg Pro Tyr Asn lie Tyr Pro His Gly lie 
485 490 495 

ACT GAT GTC CGT CCT TTG TAT TCA AGG AGA TTA CCA AAA GGT GTA AAA 2550 
Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys Gly Val Lys 
500 505 510 515 

CAT TTG AAG GAT TTT CCA ATT CTG CCA GGA GAA ATA TTC AAA TAT AAA 2 598 

His Leu Lys Asp Phe Pro lie Leu Pro Gly Glu lie Phe Lys Tyr Lys 
520 525 530 

TGG ACA GTG ACT GTA GAA GAT GGG CCA ACT AAA TCA GAT CCT CGG TGC 2646 
Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp Pro Arg Cys 
535 540 545 

CTG ACC CGC TAT TAC TCT AGT TTC GTT AAT ATG GAG AGA GAT CTA GCT 2 6 94 

Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg Asp Leu Ala 
550 555 560 

TCA GGA CTC ATT GGC CCT CTC CTC ATC TGC TAC AAA GAA TCT GTA GAT 2 742 

Ser Gly Leu lie Gly Pro Leu Leu lie Cys Tyr Lys Glu Ser Val Asp 
565 570 575 

CAA AGA GGA AAC CAG ATA ATG TCA GAC AAG AGG AAT GTC ATC CTG TTT 2 7 90 

Gin Arg Gly Asn Gin lie Met Ser Asp Lys Arg Asn Val lie Leu Phe 
580 585 590 595 

TCT GTA TTT GAT GAG AAC CGA AGC TGG TAC CTC ACA GAG AAT ATA CAA 2838 
Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu Asn lie Gin 
600 605 610 

CGC TTT CTC CCC AAT CCA GCT GGA GTG CAG CTT GAG GAT CCA GAG TTC 2886 
Arg Phe Leu Pro Asn Pro Ala Gly Val Gin Leu Glu Asp Pro Glu Phe 
615 620 625 
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CAA GCC TCC AAC ATC ATG CAC AGC ATC AAT GGC TAT GTT TTT GAT AGT 2 934 

Gin Ala Ser Asn lie Met His Ser lie Asn Gly Tyr Val Phe Asp Ser 
630 635 640 

TTG CAG TTG TCA GTT TGT TTG CAT GAG GTG GCA TAC TGG TAC ATT CTA 2 982 

Leu Gin Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp Tyr lie Leu 
645 650 655 

AGC ATT GGA GCA CAG ACT GAC TTC CTT TCT GTC TTC TTC TCT GGA TAT 3 030 

Ser lie Gly Ala Gin Thr Asp Phe Leu Ser Val Phe Phe Ser Gly Tyr 
660 665 670 675 

ACC TTC AAA CAC AAA ATG GTC TAT GAA GAC ACA CTC ACC CTA TTC CCA 3 0 76 

Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr Leu Phe Pro 
680 685 690 

TTC TCA GGA GAA ACT GTC TTC ATG TCG ATG GAA AAC CCA GGT CTA TGG 3126 
Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro Gly Leu Trp 
695 700 705 

ATT CTG GGG TGC CAC AAC TCA GAC TTT CGG AAC AGA GGC ATG ACC GCC 3174 
lie Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly Met Thr Ala 
710 715 * 720 

TTA CTG AAG GTT TCT AGT TGT GAC AAG AAC ACT GGT GAT TAT TAC GAG 3222 
Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp Tyr Tyr Glu 
725 730 735 

GAC AGT TAT GAA GAT ATT TCA GCA TAC TTG CTG AGT AAA AAC AAT GCC 32 70 

Asp Ser Tyr Glu Asp lie Ser Ala Tyr Leu Leu Ser Lys Asn Asn Ala 
740 745 750 755 

ATT GAA CCA AGA AGC TTC TCC CAG AAT TCA AGA CAC CCT AGC ACT AGG 3 318 

lie Glu Pro Arg Ser Phe Ser Gin Asn Ser Arg His Pro Ser Thr Arg 
760 765 770 

CAA AAG CAA TTT AAT GCC ACC ACA ATT CCA GAA AAT GAC ATA GAG AAG 33 66 

Gin Lys Gin Phe Asn Ala Thr Thr lie Pro Glu Asn Asp lie Glu Lys 
775 780 785 

ACT GAC CCT TGG TTT GCA CAC AGA ACA CCT ATG CCT AAA ATA CAA AAT 3414 
Thr Asp Pro Trp Phe Ala His Arg Thr Pro Met Pro Lys lie Gin Asn 
790 795 800 

GTC TCC TCT AGT GAT TTG TTG ATG CTC TTG CGA CAG AGT CCT ACT CCA 34 62 

Val Ser Ser Ser Asp Leu Leu Met Leu Leu Arg Gin Ser Pro Thr Pro 
805 810 815 

CAT GGG CTA TCC TTA TCT GAT CTC CAA GAA GCC AAA TAT GAG ACT TTT 3 510 

His Gly Leu Ser Leu Ser Asp Leu Gin Glu Ala Lys Tyr Glu Thr Phe 
820 825 830 835 

TCT GAT GAT CCA TCA CCT GGA GCA ATA GAC AGT AAT AAC AGC CTG TCT 3558 
Ser Asp Asp Pro Ser Pro Gly Ala lie Asp Ser Asn Asn Ser Leu Ser 
840 845 850 

GAA ATG ACA CAC TTC AGG CCA CAG CTC CAT CAC AGT GGG GAC ATG GTA 3 6 06 
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Glu Met Thr His Phe Arg Pro Gin Leu His His Ser Gly Asp Met Val 
855 860 865 

TTT ACC CCT GAG TCA GGC CTC CAA TTA AGA TTA AAT GAG AAA CTG GGG 3654 
Phe Thr Pro Glu Ser Gly Leu Gin Leu Arg Leu Asn Glu Lys Leu Gly 
870 875 880 

ACA ACT GCA GCA ACA GAG TTG AAG AAA CTT GAT TTC AAA GTT TCT AGT 3 702 

Thr Thr Ala Ala Thr Glu Leu Lys Lys Leu Asp Phe Lys Val Ser Ser 
885 890 895 

ACA TCA AAT AAT CTG ATT TCA ACA ATT CCA TCA GAC AAT TTG GCA GCA 3 750 

Thr Ser Asn Asn Leu lie Ser Thr lie Pro Ser Asp Asn Leu Ala Ala 
900 905 910 915 

GGT ACT GAT AAT ACA AGT TCC TTA GGA CCC CCA AGT ATG CCA GTT CAT 3 7 98 

Gly Thr Asp Asn Thr Ser Ser Leu Gly Pro Pro Ser Met Pro Val His 
920 925 930 

TAT GAT AGT CAA TTA GAT ACC ACT CTA TTT GGC AAA AAG TCA TCT CCC 3 846 

Tyr Asp Ser Gin Leu Asp Thr Thr Leu Phe Gly Lys Lys Ser Ser Pro 
935 940 945 

CTT ACT GAG TCT GGT GGA CCT CTG AGC TTG AGT GAA GAA AAT AAT GAT 3 8 94 

Leu Thr Glu Ser Gly Gly Pro Leu Ser Leu Ser Glu Glu Asn Asn Asp 
950 955 960 

TCA AAG TTG TTA GAA TCA GGT TTA ATG AAT AGC CAA GAA AGT TCA TGG 3 942 

Ser Lys Leu Leu Glu Ser Gly Leu Met Asn Ser Gin Glu Ser Ser Trp 
965 970 975 

GGA AAA AAT GTA TCG TCA ACA GAG AGT GGT AGG TTA TTT AAA GGG AAA 3 990 

Gly Lys Asn Val Ser Ser Thr Glu Ser Gly Arg Leu Phe Lys Gly Lys 
980 985 990 995 

AGA GCT CAT GGA CCT GCT TTG TTG ACT AAA GAT AAT GCC TTA TTC AAA 4 03 8 

Arg Ala His Gly Pro Ala Leu Leu Thr Lys Asp Asn Ala Leu Phe Lys 
1000 1005 1010 

GTT AGC ATC TCT TTG TTA AAG ACA AAC AAA ACT TCC AAT AAT TCA GCA 4 08 6 

Val Ser lie Ser Leu Leu Lys Thr Asn Lys Thr Ser Asn Asn Ser Ala 
1015 1020 1025 

ACT AAT AGA AAG ACT CAC ATT GAT GGC CCA TCA TTA TTA ATT GAG AAT 4134 
Thr Asn Arg Lys Thr His lie Asp Gly Pro Ser Leu Leu lie Glu Asn 
1030 1035 1040 

AGT CCA TCA GTC TGG CAA AAT ATA TTA GAA AGT GAC ACT GAG TTT AAA 4182 
Ser Pro Ser Val Trp Gin Asn lie Leu Glu Ser Asp Thr Glu Phe Lys 
1045 1050 1055 

AAA GTG ACA CCT TTG ATT CAT GAC AGA ATG CTT ATG GAC AAA AAT GCT 42 3 0 

Lys Val Thr Pro Leu lie His Asp Arg Met Leu Met Asp Lys Asn Ala 
1060 1065 1070 1075 

ACA GCT TTG AGG CTA AAT CAT ATG TCA AAT AAA ACT ACT TCA TCA AAA 42 78 

Thr Ala Leu Arg Leu Asn His Met Ser Asn Lys Thr Thr Ser Ser Lys 
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1080 1085 1090 

AAC ATG GAA ATG GTC CAA CAG AAA AAA GAG GGC CCC ATT CCA CCA GAT 4 32 6 

Asn Met Glu Met Val Gin Gin Lys Lys Glu Gly Pro lie Pro Pro Asp 
1095 1100 1105 

GCA CAA AAT CCA GAT ATG TCG TTC TTT AAG ATG CTA TTC TTG CCA GAA 43 74 

Ala Gin Asn Pro Asp Met Ser Phe Phe Lys Met Leu Phe Leu Pro Glu 
1110 1115 1120 

TCA GCA AGG TGG ATA CAA AGG ACT CAT GGA AAG AAC TCT CTG AAC TCT 44 22 

Ser Ala Arg Trp lie Gin Arg Thr His Gly Lys Asn Ser Leu Asn Ser 
1125 1130 1135 

GGG CAA GGC CCC AGT CCA AAG CAA TTA GTA TCC TTA GGA CCA GAA AAA 44 70 

Gly Gin Gly Pro Ser Pro Lys Gin Leu Val Ser Leu Gly Pro Glu Lys 
1140 1145 1150 1155 

TCT GTG GAA GGT CAG AAT TTC TTG TCT GAG AAA AAC AAA GTG GTA GTA 4 518 

Ser Val Glu Gly Gin Asn Phe Leu Ser Glu Lys Asn Lys Val Val Val 
1160 1165 1170 

GGA AAG GGT GAA TTT ACA AAG GAC GTA GGA CTC AAA GAG ATG GTT TTT 4 566 

Gly Lys Gly Glu Phe Thr Lys Asp Val Gly Leu Lys Glu Met Val Phe 
1175 1180 1185 

CCA AGC AGC AGA AAC CTA TTT CTT ACT AAC TTG GAT AAT TTA CAT GAA 4614 
Pro Ser Ser Arg Asn Leu Phe Leu Thr Asn Leu Asp Asn Leu His Glu 
1190 1195 1200 

AAT AAT ACA CAC AAT CAA GAA AAA AAA ATT CAG GAA GAA ATA GAA AAG 4662 
Asn Asn Thr His Asn Gin Glu Lys Lys lie Gin Glu Glu lie Glu Lys 
1205 1210 1215 

AAG GAA ACA TTA ATC CAA GAG AAT GTA GTT TTG CCT CAG ATA CAT ACA 4 710 

Lys Glu Thr Leu lie Gin Glu Asn Val Val Leu Pro Gin lie His Thr 
1220 1225 1230 1235 

GTG ACT GGC ACT AAG AAT TTC ATG AAG AAC CTT TTC TTA CTG AGC ACT 4 758 

Val Thr Gly Thr Lys Asn Phe Met Lys Asn Leu Phe Leu Leu Ser Thr 
1240 1245 1250 

AGG CAA AAT GTA GAA GGT TCA TAT GAG GGG GCA TAT GCT CCA GTA CTT 4 8 06 

Arg Gin Asn Val Glu Gly Ser Tyr Glu Gly Ala Tyr Ala Pro Val Leu 
1255 1260 1265 

CAA GAT TTT AGG TCA TTA AAT GAT TCA ACA AAT AGA ACA AAG AAA CAC 4 8 54 

Gin Asp Phe Arg Ser Leu Asn Asp Ser Thr Asn Arg Thr Lys Lys His 
1270 1275 1280 

ACA GCT CAT TTC TCA AAA AAA GGG GAG GAA GAA AAC TTG GAA GGC TTG 4 902 

Thr Ala His Phe Ser Lys Lys Gly Glu Glu Glu Asn Leu Glu Gly Leu 
1285 1290 1295 

GGA AAT CAA ACC AAG CAA ATT GTA GAG AAA TAT GCA TGC ACC ACA AGG 4 9 50 

Gly Asn Gin Thr Lys Gin lie Val Glu Lys Tyr Ala Cys Thr Thr Arg 
1300 1305 1310 1315 
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ATA TCT CCT AAT ACA AGC CAG CAG AAT TTT GTC ACG CAA CGT AGT AAG 4 998 

lie Ser Pro Asn Thr Ser Gin Gin Asn Phe Val Thr Gin Arg Ser Lys 
1320 1325 1330 

AGA GCT TTG AAA CAA TTC AGA CTC CCA CTA GAA GAA ACA GAA CTT GAA 504 6 

Arg Ala Leu Lys Gin Phe Arg Leu Pro Leu Glu Glu Thr Glu Leu Glu 
1335 1340 1345 

AAA AGG ATA ATT GTG GAT GAC ACC TCA ACC CAG TGG TCC AAA AAC ATG 5094 
Lys Arg lie lie Val Asp Asp Thr Ser Thr Gin Trp Ser Lys Asn Met 
1350 1355 1360 

AAA CAT TTG ACC CCG AGC ACC CTC ACA CAG ATA GAC TAC AAT GAG AAG 5142 
Lys His Leu Thr Pro Ser Thr Leu Thr Gin lie Asp Tyr Asn Glu Lys 
1365 1370 1375 

GAG AAA GGG GCC ATT ACT CAG TCT CCC TTA TCA GAT TGC CTT ACG AGG 5190 
Glu Lys Gly Ala lie Thr Gin Ser Pro Leu Ser Asp Cys Leu Thr Arg 
1380 1385 1390 1395 

AGT CAT AGC ATC CCT CAA GCA AAT AGA TCT CCA TTA CCC ATT GCA AAG 52 3 8 

Ser His Ser lie Pro Gin Ala Asn Arg Ser Pro Leu Pro lie Ala Lys 
1400 1405 1410 

GTA TCA TCA TTT CCA TCT ATT AGA CCT ATA TAT CTG ACC AGG GTC CTA 52 86 

Val Ser Ser Phe Pro Ser lie Arg Pro lie Tyr Leu Thr Arg Val Leu 
1415 1420 1425 

TTC CAA GAC AAC TCT TCT CAT CTT CCA GCA GCA TCT TAT AGA AAG AAA 53 34 

Phe Gin Asp Asn Ser Ser His Leu Pro Ala Ala Ser Tyr Arg Lys Lys 
1430 1435 1440 

GAT TCT GGG GTC CAA GAA AGC AGT CAT TTC TTA CAA GGA GCC AAA AAA 53 8 2 

Asp Ser Gly Val Gin Glu Ser Ser His Phe Leu Gin Gly Ala Lys Lys 
1445 1450 1455 

AAT AAC CTT TCT TTA GCC ATT CTA ACC TTG GAG ATG ACT GGT GAT CAA 54 3 0 

Asn Asn Leu Ser Leu Ala lie Leu Thr Leu Glu Met Thr Gly Asp Gin 
1460 1465 1470 1475 

AGA GAG GTT GGC TCC CTG GGG ACA AGT GCC ACA AAT TCA GTC ACA TAC 54 78 

Arg Glu Val Gly Ser Leu Gly Thr Ser Ala Thr Asn Ser Val Thr Tyr 
1480 1485 1490 

AAG AAA GTT GAG AAC ACT GTT CTC CCG AAA CCA GAC TTG CCC AAA ACA 552 6 

Lys Lys Val Glu Asn Thr Val Leu Pro Lys Pro Asp Leu Pro Lys Thr 
1495 1500 1505 

TCT GGC AAA GTT GAA TTG CTT CCA AAA GTT CAC ATT TAT CAG AAG GAC 5 5 74 

Ser Gly Lys Val Glu Leu Leu Pro Lys Val His lie Tyr Gin Lys Asp 
1510 1515 1520 

CTA TTC CCT ACG GAA ACT AGC AAT GGG TCT CCT GGC CAT CTG GAT CTC 56 22 

Leu Phe Pro Thr Glu Thr Ser Asn Gly Ser Pro Gly His Leu Asp Leu 
1525 1530 1535 
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GTG GAA GGG AGC CTT CTT CAG GGA ACA GAG GGA GCG ATT AAG TGG AAT 56 7 0 

Val Glu Gly Ser Leu Leu Gin Gly Thr Glu Gly Ala lie Lys Trp Asn 
1540 1545 1550 1555 

GAA GCA AAC AGA CCT GGA AAA GTT CCC TTT CTG AGA GTA GCA ACA GAA 5 718 

Glu Ala Asn Arg Pro Gly Lys Val Pro Phe Leu Arg Val Ala Thr Glu 
1560 1565 1570 

AGC TCT GCA AAG ACT CCC TCC AAG CTA TTG GAT CCT CTT GCT TGG GAT 5766 
Ser Ser Ala Lys Thr Pro Ser Lys Leu Leu Asp Pro Leu Ala Trp Asp 
1575 1580 1585 

AAC CAC TAT GGT ACT CAG ATA CCA AAA GAA GAG TGG AAA TCC CAA GAG 5814 
Asn His Tyr Gly Thr Gin lie Pro Lys Glu Glu Trp Lys Ser Gin Glu 
1590 1595 1600 

AAG TCA CCA GAA AAA ACA GCT TTT AAG AAA AAG GAT ACC ATT TTG TCC 5862 
Lys Ser Pro Glu Lys Thr Ala Phe Lys Lys Lys Asp Thr lie Leu Ser 
1605 1610 1615 

CTG AAC GCT TGT GAA AGC AAT CAT GCA ATA GCA GCA ATA AAT GAG GGA 5 910 

Leu Asn Ala Cys Glu Ser Asn His Ala lie Ala Ala lie Asn Glu Gly 
1620 1625 1630 1635 

CAA AAT AAG CCC GAA ATA GAA GTC ACC TGG GCA AAG CAA GGT AGG ACT 5 958 

Gin Asn Lys Pro Glu lie Glu Val Thr Trp Ala Lys Gin Gly Arg Thr 
1640 1645 1650 

GAA AGG CTG TGC TCT CAA AAC CCA CCA GTC TTG AAA CGC CAT CAA CGG 6006 
Glu Arg Leu Cys Ser Gin Asn Pro Pro Val Leu Lys Arg His Gin Arg 
1655 1660 1665 

GAA ATA ACT CGT ACT ACT CTT CAG TCA GAT CAA GAG GAA ATT GAC TAT 6054 
Glu lie Thr Arg Thr Thr Leu Gin Ser Asp Gin Glu Glu lie Asp Tyr 
1670 1675 1680 

GAT GAT ACC ATA TCA GTT GAA ATG AAG AAG GAA GAT TTT GAC ATT TAT 6102 
Asp Asp Thr lie Ser Val Glu Met Lys Lys Glu Asp Phe Asp lie Tyr 
1685 1690 1695 

GAT GAG GAT GAA AAT CAG AGC CCC CGC AGC TTT CAA AAG AAA ACA CGA 615 0 

Asp Glu Asp Glu Asn Gin Ser Pro Arg Ser Phe Gin Lys Lys Thr Arg 
1700 1705 1710 1715 

CAC TAT TTT ATT GCT GCA GTG GAG AGG CTC TGG GAT TAT GGG ATG AGT 6198 
His Tyr Phe lie Ala Ala Val Glu Arg Leu Trp Asp Tyr Gly Met Ser 
1720 1725 1730 

AGC TCC CCA CAT GTT CTA AGA AAC AGG GCT CAG AGT GGC AGT GTC CCT 6246 
Ser Ser Pro His Val Leu Arg Asn Arg Ala Gin Ser Gly Ser Val Pro 
1735 1740 1745 

CAG TTC AAG AAA GTT GTT TTC CAG GAA TTT ACT GAT GGC TCC TTT ACT 62 94 

Gin Phe Lys Lys Val Val Phe Gin Glu Phe Thr Asp Gly Ser Phe Thr 
1750 1755 1760 

CAG CCC TTA TAC CGT GGA GAA CTA AAT GAA CAT TTG GGA CTC CTG GGG 6342 



37 



Gin Pro Leu Tyr Arg Gly Glu Leu Asn Glu His Leu Gly Leu Leu Gly 
1765 1770 1775 

CCA TAT ATA AGA GCA GAA GTT GAA GAT AAT ATC ATG GTA ACT TTC AGA 63 90 

Pro Tyr lie Arg Ala Glu Val Glu Asp Asn lie Met Val Thr Phe Arg 
1780 1785 1790 1795 

AAT CAG GCC TCT CGT CCC TAT TCC TTC TAT TCT AGC CTT ATT TCT TAT 64 3 8 

Asn Gin Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Ser Leu lie Ser Tyr 
1800 1805 1810 

GAG GAA GAT CAG AGG CAA GGA GCA GAA CCT AGA AAA AAC TTT GTC AAG 64 86 

Glu Glu Asp Gin Arg Gin Gly Ala Glu Pro Arg Lys Asn Phe Val Lys 
1815 1820 1825 

CCT AAT GAA ACC AAA ACT TAC TTT TGG AAA GTG CAA CAT CAT ATG GCA 6 534 

Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val Gin His His Met Ala 
1830 1835 1840 

CCC ACT AAA GAT GAG TTT GAC TGC AAA GCC TGG GCT TAT TTC TCT GAT 6582 
Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp Ala Tyr Phe Ser Asp 
1845 1850 1855 

GTT GAC CTG GAA AAA GAT GTG CAC TCA GGC CTG ATT GGA CCC CTT CTG 663 0 

Val Asp Leu Glu Lys Asp Val His Ser Gly Leu lie Gly Pro Leu Leu 
I860 1865 1870 1875 

GTC TGC CAC ACT AAC ACA CTG AAC CCT GCT CAT GGG AGA CAA GTG ACA 66 7 8 

Val Cys His Thr Asn Thr Leu Asn Pro Ala His Gly Arg Gin Val Thr 
1880 1885 1890 

GTA CAG GAA TTT GCT CTG TTT TTC ACC ATC TTT GAT GAG ACC AAA AGC 6726 
Val Gin Glu Phe Ala Leu Phe Phe Thr lie Phe Asp Glu Thr Lys Ser 
1895 1900 1905 

TGG TAC TTC ACT GAA AAT ATG GAA AGA AAC TGC AGG GCT CCC TGC AAT 6 7 74 

Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys Arg Ala Pro Cys Asn 
1910 1915 1920 

ATC CAG ATG GAA GAT CCC ACT TTT AAA GAG AAT TAT CGC TTC CAT GCA 682 2 

lie Gin Met Glu Asp Pro Thr Phe Lys Glu Asn Tyr Arg Phe His Ala 
1925 1930 1935 

ATC AAT GGC TAC ATA ATG GAT ACA CTA CCT GGC TTA GTA ATG GCT CAG 6 8 70 

lie Asn Gly Tyr lie Met Asp Thr Leu Pro Gly Leu Val Met Ala Gin 
1940 1945 1950 1955 

GAT CAA AGG ATT CGA TGG TAT CTG CTC AGC ATG GGC AGC AAT GAA AAC 6 918 

Asp Gin Arg lie Arg Trp Tyr Leu Leu Ser Met Gly Ser Asn Glu Asn 
1960 1965 1970 

ATC CAT TCT ATT CAT TTC AGT GGA CAT GTG TTC ACT GTA CGA AAA AAA 6 966 

lie His Ser lie His Phe Ser Gly His Val Phe Thr Val Arg Lys Lys 
1975 1980 1985 

GAG GAG TAT AAA ATG GCA CTG TAC AAT CTC TAT CCA GGT GTT TTT GAG 7 014 

Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr Pro Gly Val Phe Glu 
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1990 1995 2000 

ACA GTG GAA ATG TTA CCA TCC AAA GCT GGA ATT TGG CGG GTG GAA TGC 7062 
Thr Val Glu Met Leu Pro Ser Lys Ala Gly lie Trp Arg Val Glu Cys 
2005 2010 2015 

CTT ATT GGC GAG CAT CTA CAT GCT GGG ATG AGC ACA CTT TTT CTG GTG 7110 
Leu lie Gly Glu His Leu His Ala Gly Met Ser Thr Leu Phe Leu Val 
2020 2025 2030 2035 

TAC AGC AAT AAG TGT CAG ACT CCC CTG GGA ATG GCT TCT GGA CAC ATT 7158 
Tyr Ser Asn Lys Cys Gin Thr Pro Leu Gly Met Ala Ser Gly His lie 
2040 2045 2050 

AGA GAT TTT CAG ATT ACA GCT TCA GGA CAA TAT GGA CAG TGG GCC CCA 72 06 

Arg Asp Phe Gin lie Thr Ala Ser Gly Gin Tyr Gly Gin Trp Ala Pro 
2055 2060 2065 

AAG CTG GCC AGA CTT CAT TAT TCC GGA TCA ATC AAT GCC TGG AGC ACC 72 54 

Lys Leu Ala Arg Leu His Tyr Ser Gly Ser lie Asn Ala Trp Ser Thr 
2070 2075 2080 

AAG GAG CCC TTT TCT TGG ATC AAG GTG GAT CTG TTG GCA CCA ATG ATT 73 02 

Lys Glu Pro Phe Ser Trp lie Lys Val Asp Leu Leu Ala Pro Met lie 
2085 2090 2095 

ATT CAC GGC ATC AAG ACC CAG GGT GCC CGT CAG AAG TTC TCC AGC CTC 73 50 

lie His Gly lie Lys Thr Gin Gly Ala Arg Gin Lys Phe Ser Ser Leu 
2100 2105 2110 2115 

TAC ATC TCT CAG TTT ATC ATC ATG TAT AGT CTT GAT GGG AAG AAG TGG 73 98 

Tyr lie Ser Gin Phe lie lie Met Tyr Ser Leu Asp Gly Lys Lys Trp 
2120 2125 2130 

CAG ACT TAT CGA GGA AAT TCC ACT GGA ACC TTA ATG GTC TTC TTT GGC 7446 
Gin Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu Met Val Phe Phe Gly 
2135 2140 2145 

AAT GTG GAT TCA TCT GGG ATA AAA CAC AAT ATT TTT AAC CCT CCA ATT 74 94 

Asn Val Asp Ser Ser Gly lie Lys His Asn lie Phe Asn Pro Pro lie 
2150 2155 2160 

ATT GCT CGA TAC ATC CGT TTG CAC CCA ACT CAT TAT AGC ATT CGC AGC 7542 
lie Ala Arg Tyr lie Arg Leu His Pro Thr His Tyr Ser lie Arg Ser 
2165 2170 2175 

ACT CTT CGC ATG GAG TTG ATG GGC TGT GAT TTA AAT AGT TGC AGC ATG 7 590 

Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu Asn Ser Cys Ser Met 
2180 2185 2190 2195 

CCA TTG GGA ATG GAG AGT AAA GCA ATA TCA GAT GCA CAG ATT ACT GCT 76 3 8 

Pro Leu Gly Met Glu Ser Lys Ala lie Ser Asp Ala Gin lie Thr Ala 
2200 2205 2210 

TCA TCC TAC TTT ACC AAT ATG TTT GCC ACC TGG TCT CCT TCA AAA GCT 76 86 

Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp Ser Pro Ser Lys Ala 
2215 2220 2225 
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CGA CTT CAC CTC CAA GGG AGG AGT AAT GCC TGG AGA CCT CAG GTG AAT 7734 
Arg Leu His Leu Gin Gly Arg Ser Asn Ala Trp Arg Pro Gin Val Asn 
2230 2235 2240 

AAT CCA AAA GAG TGG CTG CAA GTG GAC TTC CAG AAG ACA ATG AAA GTC 7782 
Asn Pro Lys Glu Trp Leu Gin Val Asp Phe Gin Lys Thr Met Lys Val 
2245 2250 2255 

ACA GGA GTA ACT ACT CAG GGA GTA AAA TCT CTG CTT ACC AGC ATG TAT 783 0 
Thr Gly Val Thr Thr Gin Gly Val Lys Ser Leu Leu Thr Ser Met Tyr 
2260 2265 2270 2275 

GTG AAG GAG TTC CTC ATC TCC AGC AGT CAA GAT GGC CAT CAG TGG ACT 78 78 
Val Lys Glu Phe Leu lie Ser Ser Ser Gin Asp Gly His Gin Trp Thr 
2280 2285 2290 

CTC TTT TTT CAG AAT GGC AAA GTA AAG GTT TTT CAG GGA AAT CAA GAC 7 926 
Leu Phe Phe Gin Asn Gly Lys Val Lys Val Phe Gin Gly Asn Gin Asp 
2295 2300 2305 

TCC TTC ACA CCT GTG GTG AAC TCT CTA GAC CCA CCG TTA CTG ACT CGC 7 974 
Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro Pro Leu Leu Thr Arg 
2310 2315 2320 

TAC CTT CGA ATT CAC CCC CAG AGT TGG GTG CAC CAG ATT GCC CTG AGG 8 022 
Tyr Leu Arg lie His Pro Gin Ser Trp Val His Gin lie Ala Leu Arg 
2325 2330 2335 

ATG GAG GTT CTG GGC TGC GAG GCA CAG GAC CTC TAC TGAGGGTGGC 8 06 8 
Met Glu Val Leu Gly Cys Glu Ala Gin Asp Leu Tyr 
2340 2345 2350 

CACTGCAGCA CCTGCCACTG CCGTCACCTC TCCCTCCTCA GCTCCAGGGC AGTGTCCCTC 812 8 

CCTGGCTTGC CTTCTACCTT TGTGCTAAAT C CT AGC AG AC ACTGCCTTGA AGCCTCCTGA 8188 

ATTAACTATC ATCAGTCCTG CATTTCTTTG GTGGGGGGCC AGGAGGGTGC ATCCAATTTA 8248 

ACTTAACTCT TACCTATTTT CTGCAGCTGC TCCCAGATTA CTCCTTCCTT CCAATATAAC 8308 

TAGGCAAAAA GAAGTGAGGA GAAACCTGCA TGAAAGCATT CTTCCCTGAA AAGTTAGGCC 8368 

TCTCAGAGTC ACCACTTCCT CTG TTGT AG A AAAACTATGT GATGAAACTT TGAAAAAGAT 84 2 8 

ATTTATGATG TTAACTTGTT TATTGCAGCT TATAATGGTT ACAAATAAAG CAATAGCATC 84 88 

ACAAATTTCA CAAATAAAGC ATTTTTTTCA CTGCATTCTA GTTGTGGTTT GTCCAAACTC 8 54 8 

ATCAATGTAT CTTATCATGT CTGGATCCCC GGGTGGCATC CCTGTGACCC CTCCCCAGTG 86 08 

CCTCTCCTGG CCCTGGAAGT TGCCACTCCA GTGCCCACCA GCCTTGTCCT AATAAAATTA 866 8 

AGTTGCATCA TTTTGTCTGA CTAGGTGTCC TTCTATAATA TTATGGGGTG GAGGGGGGTG 8 72 8 

GTATGGAGCA AGGGGCAAGT TGGGAAGACA ACCTGTAGGG CCTGCGGGGT CTATTCGGGA 87 8 8 
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ACCAAGCTGG AGTGCAGTGG CACAATCTTG GCTCACTGCA ATCTCCGCCT CCTGGGTTCA 8848 
AGCGATTCTC CTGCCTCAGC CTCCCGAGTT GTTGGGATTC CAGGCATGCA TGACCAGGCT 8 908 
CAGCTAATTT TTGTTTTTTT GGTAGAGACG GGGTTTCACC ATATTGGCCA GGCTGGTCTC 8 96 8 
CAACTCCTAA TC TCAGGTG A TCTACCCACC TTGGCCTCCC AAATTGCTGG GATTACAGGC 902 8 
GTGAACCACT GCTCCCTTCC CTGTCCTTCT GATTTTAAAA TAACTATACC AGCAGGAGGA 9088 
C GTC C AG AC A CAGCATAGGC TACCTGCCAT GCCCAACCGG TGGGACATTT GAGTTGCTTG 914 8 
CTTGGCACTG TCCTCTCATG CGTTGGGTCC ACTCAGTAGA TGCCTGTTGA ATTCGTAATC 92 08 
ATGGTCATAG CTGTTTCCTG TGTGAAATTG TTATCCGCTC ACAATTCCAC ACAACATACG 92 68 
AGCCGGAAGC ATAAAGTGTA AAGCCTGGGG TGCCTAATGA GTGAGCTAAC TCACATTAAT 93 2 8 
TGCGTTGCGC TCACTGCCCG CTTTCCAGTC GGGAAACCTG TCGTGCCAGC TGCATTAATG 93 8 8 
AATCGGCCAA CGCGCGGGGA GAGGCGGTTT GCGTATTGGG CGCTCTTCCG CTTCCTCGCT 944 8 
CACTGACTCG CTGCGCTCGG TCGTTCGGCT GCGGCGAGCG GTATCAGCTC ACTCAAAGGC 9508 
GGTAATACGG TTATCCACAG AATCAGGGGA TAACGCAGGA AAGAACATGT GAGCAAAAGG 956 8 
CCAGCAAAAG GCCAGGAACC GTAAAAAGGC CGCGTTGCTG GCGTTTTTCC ATAGGCTCCG 962 8 
CCCCCCTGAC GAGCATCACA AAAATCGACG CTCAAGTCAG AGGTGGCGAA ACCCGACAGG 96 8 8 
ACTATAAAGA TACCAGGCGT TTCCCCCTGG AAGCTCCCTC GTGCGCTCTC CTGTTCCGAC 974 8 
CCTGCCGCTT ACCGGATACC TGTCCGCCTT TCTCCCTTCG GGAAGCGTGG CGCTTTCTCA 9808 
TAGCTCACGC TGTAGGTATC TCAGTTCGGT GTAGGTCGTT CGCTCCAAGC TGGGCTGTGT 9868 
GCACGAACCC CCCGTTCAGC CCGACCGCTG CGCCTTATCC GGTAACTATC GTCTTGAGTC 992 8 
CAACCCGGTA AGACACGACT TATCGCCACT GGCAGCAGCC ACTGGTAACA GGATTAGCAG 9988 
AGCGAGGTAT GTAGGCGGTG CTACAGAGTT CTTGAAGTGG TGGCCTAACT ACGGCTACAC 10048 
TAGAAGGACA GTATTTGGTA TCTGCGCTCT GCTGAAGCCA GTTACCTTCG GAAAAAGAGT 10108 
TGGTAGCTCT TGATCCGGCA AACAAACCAC CGCTGGTAGC GGTGGTTTTT TTGTTTGCAA 1016 8 
GCAGCAGATT ACGCGCAGAA AAAAAGGATC TCAAGAAGAT CCTTTGATCT TTTCTACGGG 10228 
GTCTGACGCT CAGTGGAACG AAAACTCACG TTAAGGGATT TTGGTCATGA GATTATCAAA 102 8 8 
AAGGATCTTC ACCTAGATCC TTTTAAATTA AAAATGAAGT TTTAAATCAA TCTAAAGTAT 10348 
ATATGAGTAA ACTTGGTCTG ACAGTTACCA ATGCTTAATC AGTGAGGCAC CTATCTCAGC 10408 
GATCTGTCTA TTTCGTTCAT CCATAGTTGC CTGACTCCCC GTCGTGTAGA TAACTACGAT 10468 
ACGGGAGGGC TTACCATCTG GCCCCAGTGC TGCAATGATA CCGCGAGACC CACGCTCACC 10 528 



GGCTCCAGAT TTATCAGCAA TAAACCAGCC AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC 10588 
TGCAACTTTA TCCGCCTCCA TCCAGTCTAT TAATTGTTGC CGGGAAGCTA GAGTAAGTAG 1064 8 
TTCGCCAGTT AATAGTTTGC GCAACGTTGT TGCCATTGCT ACAGGCATCG TGGTGTCACG 107 08 
CTCGTCGTTT GGTATGGCTT CATTCAGCTC CGGTTCCCAA CGATCAAGGC GAGTTACATG 10768 
ATCCCCCATG TTGTGCAAAA AAGCGGTTAG CTCCTTCGGT CCTCCGATCG TTGTCAGAAG 10828 
TAAGTTGGCC GCAGTGTTAT CACTCATGGT TATGGCAGCA CTGCATAATT CTCTTACTGT 10888 
CATGCCATCC GTAAGATGCT TTTCTGTGAC TGGTGAGTAC TCAACCAAGT CATTCTGAGA 10 948 
ATAGTGTATG CGGCGACCGA GTTGCTCTTG CCCGGCGTCA ATACGGGATA ATACCGCGCC 11008 
ACATAGCAGA ACTTTAAAAG TGCTCATCAT TGGAAAACGT TCTTCGGGGC GAAAACTCTC 11068 
AAGGATCTTA CCGCTGTTGA GATCCAGTTC GATGTAACCC ACTCGTGCAC CCAACTGATC 1112 8 
TTCAGCATCT TTTACTTTCA CCAGCGTTTC TGGGTG AG C A AAAACAGGAA GGCAAAATGC 11188 
CGCAAAAAAG GGAATAAGGG CGACACGGAA ATGTTGAATA CTCATACTCT TCCTTTTTCA 1124 8 
ATATTATTGA AGCATTTATC AGGGTTATTG TCTCATGAGC GGATACATAT TTGAATGTAT 113 0 8 
TTAGAAAAAT AAACAAATAG GGGTTCCGCG CACATTTCCC CGAAAAGTGC CACCTGACGT 113 6 8 
CTAAGAAACC ATTATTATCA TGACATTAAC CTATAAAAAT AGGCGTATCA CGAGGCCCTT 1142 8 
TCGTCTCGCG CGTTTCGGTG ATGACGGTGA AAACCTCTGA CACATGCAGC TCCCGGAGAC 114 8 8 
GGTCACAGCT TGTCTGTAAG CGGATGCCGG GAGCAGACAA GCCCGTCAGG GCGCGTCAGC 11548 
GGGTGTTGGC GGGTGTCGGG GCTGGCTTAA CTATGCGGCA TCAGAGCAGA TTGTACTGAG 116 08 
AGTGCACCAT ATGCGGTGTG AAATACCGCA CAGATGCGTA AGGAGAAAAT ACCGCATCAG 11668 
GCGCCATTCG CCATTCAGGC TGCGCAACTG TTGGGAAGGG CGATCGGTGC GGGCCTCTTC 11728 
GCTATTACGC CAGCTGGCGA AAGGGGGATG TGCTGCAAGG CGATTAAGTT GGGTAACGCC 1178 8 
AGGGTTTTCC CAGTCACGAC GTTGTAAAAC GACGGCCAGT GCCAAGCTTG GGCTGCAG 1184 6 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
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ATTGAACCAA GAAGCTTCTC CCAGGTAAGT TGCTAATAAA GCTTGGCAAG AGTATTTCAA 
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GGAAGATGAA GTCATTAACT ATGCAAAATG CTTCTCAGGC ACCTAGGAAA ATGAGGATGT 12 0 
GAGGCATTTC TACCCACTTG GTACATAAAA TTATTGCTTT TCCTCTTCTT TTTTTCTCCA 18 0 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
GTTGGTATCC TTTTTACAGC ACAACTTAAT GAGACAGATA GAAACTGGTC TTGTAGAAAC 60 
AGAGTAGTCG CCTGCTTTTC TGCCAGGTGC TGACTTCTCT CCCCTGGGCT GTTTTCATTT 12 0 
TCTCAG 12 6 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GTAAGTATCC TTTTTACAGC ACAACTTAAT GAGACAGATA GAAACTGGTC TTGTAGAAAC 6 0 

AGAGTAGTCG CCTGCTTTTC TGCCAGGTGC TGACTTCTCT CCCCTTCTCT TTTTTCCTTT 12 0 
TCTCAG 126 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GCCACCAUGG 10 



GAACCCACCA GTCTTGAAAC GCCATCAACG G 



211 
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(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
AGGTTAATTT TTAAAAAGCA GTCAAAAGTC CAAGTGGCCC TTGCGAGCAT TTACTCTCTC 6 0 

TGTTTGCTCT GGTTAATAAT CTCAGGAGCA CAAACATTCC 100 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTTTCTCTTT TCTTTTACAT GAAGGGTCTG GCAGCCAAAG CAATCACTCA AAGTTCAAAC 60 
CTTATCATTT TTTGCTTTGT TCCTCTTGGC CTTGGTTTTG TACATCAGCT TTGAAAATAC 12 0 
CATCCCAGGG TTAATGCTGG GGTTAATTTA TAACTAAGAG TGCTCTAGTT TTGCAATACA 180 
GGACATGCTA TAAAAATGGA AAGATGTTGC TTTCTGAGAG ATA 22 3 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
AGAUCUCGAG AAAGCUAACA ACAAAGAACA ACAAACAACA AUCAGGAUAA CAAGAACGAA 6 0 

ACAAUAACAG CCACCAUGGA AAUAGAGCUC 90 
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